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Preface 



The Second International Workshop on Automated Deduction in Geometry 
(ADG ’98) was held in Beijing, Ghina, August 1-3, 1998. An increase of interest 
in ADG ’98 over the previous workshop ADG ’96 is represented by the notable 
number of more than 40 participants from ten countries and the strong techni- 
cal program of 25 presentations, of which two one-hour invited talks were given 
by Professors Wen-tsiin Wu and Jing-Zhong Zhang. The workshop provided the 
participants with a well-focused forum for effective exchange of new ideas and 
timely report of research progress. Insight surveys, algorithmic developments, 
and applications in GAGD/GAD and computer vision presented by active re- 
searchers, together with geometry software demos, shed light on the features of 
this second workshop. ADG ’98 was hosted by the Mathematics Mechanization 
Research Genter (MMRG) with financial support from the Ghinese Academy of 
Sciences and the French National Genter for Scientific Research (GNRS), and 
was organized by the three co-editors of this proceedings volume. 

The papers contained in the volume were selected, under a strict refereeing 
procedure, from those presented at ADG ’98 and submitted afterwards. Most of 
the 14 accepted papers were carefully revised and some of the revised versions 
were checked again by external reviewers. We hope that these papers cover some 
of the most recent and significant research results and developments and reflect 
the current state-of-the-art of ADG. 

We want to thank all those members of MMRG who contributed to the 
local organization of ADG ’98 and the referees for their time and help. We also 
want to acknowledge some of the previous workshops and conferences and the 
corresponding publications listed below which have stimulated the continuing 
activities on ADG. 

— International Workshop on Automated Deduction in Geometry (ADG ’96) 
(Toulouse, France, September 27-29, 1996) — LNAI, vol. 1360 (edited by 
D. Wang), Springer- Verlag, Berlin Heidelberg New York, 1997 

— Workshop on Algebraic Approaches to Geometric Reasoning (Gastle of Wein- 
berg, Austria, August 17-19, 1992) — Special issue of Annals of Mathematics 
and Artificial Intelligence, vol. 13, nos. 1-2 (edited by H. Hong, D. Wang 
and F. Winkler), Baltzer Sci. Publ., Amsterdam, 1995 

— International Workshop on Mathematics Mechanization (Beijing, Ghina, 
July 16-18, 1992) — Proceedings (edited by W.-T. Wu and M.-D. Gheng), 
Int. Academic Publ., Beijing, 1992 

— Workshop on Gomputer-Aided Geometric Reasoning (INRIA Sophia-Anti- 
polis, June 22-26, 1987) — Proceedings (edited by H. Grapo), INRIA Roc- 
quencourt, France, 1987 
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~ International Workshop on Geometric Reasoning (Oxford, UK, June 30 ~ 
July 3, 1986) — Special issue of Artificial Intelligence, vol. 37, nos. 1-3 
(edited by D. Kapur and J. L. Mundy), MIT Press, Cambridge, 1988 
— Asian Symposia on Computer Mathematics: ASCM ’98 (Lanzhou, China, 
August 6-8, 1998) — Proceedings (edited by Z. Li), Lanzhou Univ. Press, 
Lanzhou, 1998; ASCM ’96 (Kobe, Japan, August 20-22, 1996) — Proceed- 
ings (edited by H. Kobayashi), Scientists Inc., Tokyo, 1996; ASCM ’95 (Bei- 
jing, China, August 18-20, 1995) — Proceedings (edited by H. Shi and 
H. Kobayashi), Scientists Inc., Tokyo, 1995 
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Automatic Geometry Theorem-Proving and 
Automatic Geometry Problem-Solving 



Wen-tsiin Wu 

Institute of Systems Science, Academia Sinica 
Beijing 100080, P. R. China 
wtwu@mmrc . iss .ac.cn 



1 Principle and Method 

Studies in geometry involve both theorem-proving and problem-solving. So far 
the latter is concerned, we may cite for example geometrical locus-determination 
and geometrical constructions by ruler-compass which were much studied in 
ancient Greece. In particular we may mention the regular polygon construction 
and the three famous difficult problems of angle-trisection, cube-duplication and 
circle-squaring. 

A turning point of geometrical studies occurred in 1637 when R. Descartes 
published his classic on geometry. The introduction of coordinate system turned 
then geometrical studies to algebraic ones in replacing pure geometrical reason- 
ings by manipulation of polynomials and solving of polynomial equations. This 
paved the way to the final solutions of the geometrical-construction problems 
mentioned above. 

Our method of manipulating polynomials and solving arbitrary systems of 
polynomial equations in the form of Well-Ordering Principle and Zero-Decompo- 
sition Theorems permits to treat geometrical studies even in an automated way. 
The germ of such studies as well as their wide-range applications lies in our gen- 
eral method of polynomial equations-solving. To be precise, let the unknowns of 
the polynomials in question be definite ones forming a finite set X and their coef- 
ficients be in a fixed field K of characteristic 0. By a polynomial set (abbr. polset) 
PS C K[A] we shall mean one consisting of a finite number of non-zero poly- 
nomials in K[A] unless otherwise stated. For such a polset PS the solving of 
equations PS = 0 is then equivalent to the determination of the set Zero(PS') 
consisting of the totality of solutions of PS = 0 or zeros of PS in an arbitrary 
extension field of K. For any polynomial G € K[A'], we shall denote the set- 
difference Zero(PS') \ Zero(G) by Zero{PS/G). We remark in passing that the 
set Zero(PS') is nothing else but an affine algebraic variety, irreducible or not, 
in the usual sense. Now the difficulty of solving polynomial equations arises in 
essence from the fact that the polynomials in question are in general given in 
confused order. The key point to overcome such difficulties is thus: The given 
polset should be so manipulated in a way that the resulting re-arranged poly- 
nomials will be in some well-ordered form for which the corresponding solutions 
are almost self-evident and are closely related to those of the original ones to 
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be sought for. In fact, in the case where all polynomials are linear in the un- 
knowns X, the usual elimination method (say the Gaussian one) is just such a 
method which, after due manipulations, will lead to a set of well-ordered linear 
forms from which the solutions of the corresponding equations are immediately 
given. 

With this understood we shall introduce a well-ordering procedure of poly- 
nomials and polsets in the following way. 

1. Ordering of Unknowns. First of all we shall arrange the unknowns X in 
an arbitrary but fixed order, denoted henceforth hy xi,X 2 , ■ ■ ■ ,Xn in the natural 
order 

Xi X2 Xn- 

2. Partial Ordering of Polynomials. For a non-constant polynomial P e 
K[X] let Xc be the unknown actually occurring in P with greatest subscript c. 
Then P may be written in the canonical form below: 

P = Fa;)? -|- Jix'^ ^ -|- ■ ■ ■ -l- Jd-iXc + Jd, 

in which the coefficients I, Ji are themselves polynomials in K[a;i, . . . , a;c-i] with 
1^0. We call c (> 0) the class of P, d (> 1) the degree of P, I (yf 0) the initial 
of P, and the formal partial derivative dP/dxc the separant of P. In case P is 
a non-zero constant in K, we set the class of P to be 0, and leave the degree 
etc. undefined. We now introduce a partial ordering for polynomials in K[X] 
first according to the class, and then to the degree. 

3. Partial Ordering of Polsets of Particular Form. Consider now a polset 
AS of particular form in which the polynomials may be arranged in an ascending 
order: 

Ai ^ A.2 ^ Aj.- (1) 

We suppose that either r = 1 and Ai is a non-zero constant in K, or r > 1 with 
corresponding classes Ci of Ai steadily increasing: 

0 < Cl < ■ ■ ■ < Cr- 

For technical reasons we shall require that for each pair (i,j) with 0 < j < i, 
the degree of Xc^ in the initial li of Ai should be less than the degree of Aj. 
Under such conditions the polset AS = {Ai,...,Ar} is called an ascending 
set (abbr. asc-set). The product of all the initials li (resp. all the initials and 
separants) will be called the initial-product (resp. the initial-separant-product) 
of AcS. For any polynomial P G K[X], we shall get by successive divisions with 
respect to Ar, A 2 , Ai a unique remainder R given in the form 

R = I{^ ■■■PpP-Y,Q^A,, 

i 

with Qi € K[X], Si (> 0) as small as possible, so that R is reduced with respect 
to AS in the following sense: the degree of Xa in R is less than that of Ai for 
each i with 0 < i < r. This remainder R will be denoted as Remdr(P/AS'). 
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A partial ordering for all asc-sets is now introduced which we shall not enter. 
A consequence of the ordering is the following critical theorem: 

A sequence of asc-sets steadily decreasing in order is necessarily finite. 

4. Partial Ordering of Polsets. For any non-empty polset PS C K[A] let 
us consider asc-sets contained in PS which necessarily exist. Any such asc-set 
of lowest ordering is called a basic-set of PS. We may then unambiguously in- 
troduce a partial ordering for all polsets according to their basic-sets. Again a 
critical theorem says that, the ordering of a polset will be lowered by adjoining 
to it a polynomial reduced with respect to any basic-set of the given polset. 
Furthermore, as mentioned above, such reduced polynomials may be gotten by 
forming remainders of polynomials with respect to such basic-sets. 

Our method of solving polynomial equations is based on the following obser- 
vation. 

Let A? be an asc-set as in (1). For r = 1 and Ai a non-zero constant in K 
or Cl = 0 it is clear that Zero(AS') = 0. In the general case Zero(AS') may also be 
considered as well-determined in the following sense. Let U = {xj \ j ^ any Ci} 
be considered as an arbitrary parameter set. From Ai = 0 each x^ may then 
successively be considered as a well-defined algebraic function of Xk with k < Ci. 
For given definite values in K of U, we may solve Ai = 0 successively for x^. 
Hence in any case Zero(AS'), and Zero(AS'/G) too for any G G K[A], may be 
considered as well-determined. 

With the above understood our underlying principle and method of polyno- 
mial equations-solving consist in determining an asc-set AS from a given polset 
PS such that Zero(AS') is closely related to Zero(PS'), which will give at least 
a partial determination of an important part of Zero(PS') and serves to lead 
to a complete determination of Zero(PS'). By the critical theorems described 
above such an asc-set may be gotten by successively enlarging the given polset 
in adjoining remainders with respect to some relevant asc-sets. 

The whole procedure of getting such an asc-set from a polset PS may be 
given in the form of a concise diagram below: 

PS= PSo PSi ■■■ PS^ ■■■ PSm 

BSo BSi ■■■ BS, ■■■ BSm = CS (2) 

RSo RSi ■■■ PS, ■■■ PSm = 0- 

In the scheme (2) all the remainder sets PSk are non-empty for k < m and we 
have put 

BSi = a basic set of PSi, 

PSi = {Remdr{P/BSi) \ P £ PSi\ BSi}, 

PS, = PSo U BS,-i U PS,-i. 

From the critical theorems the basic-sets BSi will be steadily decreasing in order 
so that the procedure will end at some stage m with final PSm = 0- The final 
basic-set BSm = CS is then the asc-set as required and is called a characteristic 
set (abbr. char-set) of the given polset PS. The importance of the notion of 
char-set may be seen from the following theorem. 
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Well- Ordering Principle. For the char-set CS of PS let IP (or ISP) be the 
initial-product (or the initial-separant-product) of the asc-set CS. Then 

Zero(PS') = Zero{CS/IP) (J Zero{PS U {IP}), or (3) 

Zero(PS’) = Zero{CS/ISP) |J Zero(P5 U {/SP}). (4) 

By reasoning about PS U {IP} (or PS U {/SP}) in the same manner as PS 
further and further we get finally the following theorems. 

Zero-Decomposition Theorem. From a given polset PS one may determine 
in an algorithmic way a set of asc-sets CSi (or CS}) with initial-products IPi (or 
initial-separant-products ISP}) such that 



Zero(PS') = \^Zero{ASi/ IPi), or 

i 


(5) 


Zero(PS') = |JZero(^'//SP'). 


(6) 



j 



Define an asc-set to be irreducible in some natural way; then we may refine 
the above decomposition theorem to the following form: 

Variety-Decomposition Theorem. From a given polset PS one may deter- 
mine in an algorithmic way a set of irreducible asc-sets IRRk such that for some 
so-called Chow basis CBk C K[V] which may be determined algorithmically 
from IRRk, we have 

Zero(PS) = lJVar[/i?Rfc], (7) 

k 

in which Var[/i?Rfc] = Zero(CPfc) C Zero{IRRk). 

These theorems furnish us a general method (to be called the char-set 
method) for polynomial equations-solving. It serves also as the basis of appli- 
cations to various kinds of problems arising from science and technology, besides 
those from mathematics itself. For more information we refer to relevant pa- 
pers due to various authors mainly at MMRC as well as a forthcoming book by 
the present author. The book is entitled {{Mathematics Mechanization: Geom- 
etry Theorem- Proving, Geometry Problem-Solving, and Polynomial Equations- 
Solving » which will be published by Science Press, Beijing. 

We have to point out that the idea of introducing ordering of unknowns 
and polynomials as well as the elimination procedure for solving polynomial 
equations were both originated in works of mathematicians in ancient China; see 
Section 3 below. In following the line of thought of our ancestors and applying 
some techniques of modern mathematics mainly due to J. F. Ritt we arrive 
finally at the Zero-Decomposition Theorems which settle completely the problem 
of polynomial equations-solving at least in some theoretical sense. We remark 
also that the above is not the only way of ordering polynomials and polsets. In 
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fact, in introducing partial orderings in some other manner, working in some 
way as described in the Riquier-Janet theory on partial differential equations, 
and proceeding somewhat parallel to the case of the char-set method, we arrive 
finally at some notion which is equivalent to the well-known Grobner basis of an 
ideal. For more details we refer again to the writings mentioned above. 

2 Applications and Examples 

Among the applications which have been made in recent years by myself and my 
colleagues notably at MMRC we may mention in particular the following: 

— Automated Geometrical Constructions. 

— Automated Determination of Geometrical Loci. 

— Automated Discovering of Unknown Relations. 

— Automated Design of Mechanisms. 

~ Automated CAGD (Computer-Aided Geometric Design). 

— Automated Geometry Theorem-Proving and -Discovering. 

~ Automated Proving and Discovering of Inequalities. 

— Automated Global Optimization. 

For brevity we shall consider below the application to automated geometry 
theorem-proving and thus also automated geometry theorem-discovering alone 
in more details. 

The creation of coordinate geometry turned the Euclidean fashion of proving 
geometry theorems one-by-one by logical reasonings into the Cartesian fashion 
of proving classes of geometry theorems as a whole by algebraic computations. 
However, it is not evident at all whether a geometric statement, in whatever ge- 
ometry under an axiomatic system. Euclidean, non-Euclidean, affine, projective, 
etc., may be turned into polynomial equations or inequations (and eventually 
also inequalities). It is true that this is indeed the case for most of the ordi- 
nary geometries we are well-acquainted with. However, the proof of such a fact 
is not trivial at all. On the other hand, in the hands of geometers notably in 
the 19th century, the realm of geometry has been so broadened that we may 
develop various kinds of geometries directly from number systems without any 
reference to axiomatic systems, e.g. the Pliickerian line geometry and geometries 
in the sense of F. Klein. Even more, for differential geometries, one starts usually 
in introducing curves, surfaces, and entities of geometrical interest directly by 
equations. The present author is even ignorant of any axiomatic systems over 
which some kind of differential geometry is relied upon. 

This being understood we shall restrict ourselves to such geometry theorems 
for which the hypothesis and conclusion are both expressible by equations and 
inequations (eventually also inequalities), all in the form of polynomial ones, 
leaving aside the possible underlying geometries and axiomatic systems. For 
further simplification, we shall restrict ourselves to theorems of the form T = 
{HYP, cove}, for which the hypothesis and the conclusion are respectively given 
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by the equations HYP = 0 and CONC = 0 with HYP C K[X],OJNC e K[X], K 
being a field of characteristic 0 and X = {x\, . . . ,Xn} as before. 

The basic principle for proving theorems of the above type is as follows. 

Each zero X^ in Zero{HYP) is a geometrical configuration in eventually an 
extended affine space K'", with coordinates X, K' being an extension field of K, 
which verifies the hypothesis of the theorem T. The conclusion is true for such 
a geometrical configuration if CONC = 0 at the corresponding zero X^. Proving 
a geometrical theorem thus amounts to seeking for some significant part P of 
Zero{HYP) such that CONC = 0 on that part P. The theorems on zero-set 
structures of Zero{HYP) which give us methods of polynomial equations-solving 
will serve to offer means of finding such part P and to verify whether CONC = 0 
on P. In this sense our method of geometry theorem-proving may be considered 
as a mere application of our general method of polynomial equations-solving. 

Each of the theorems for polynomial equations-solving will thus lead to a 
principle and a method of automated geometry theorem-proving. 

Well-Ordering Principle (3) Principle-Method I. For a given theorem 
T = {HYP, CONC} let CS be a char-set of the hypothesis polset HYP and IP be 
the initial-product of CS. Then 

Remdr((X»7VC'/CS') = 0 (8) 



is a sufficient condition for 

Zero{HYP/IP) C Zero(COAU). 

In other words (8) is a sufficient condition for the theorem T to be true so far 

IP 7^ 0. (9) 

Moreover, the condition (8) is also a necessary one for the above to be true in 
case CS is an irreducible asc-set. 

Zero-Decomposition Theorem (5) Principle-Method II. For a given 
theorem T = {HYP, CONC} let {^i} be a set of asc-sets determined from the 
hypothesis polset HYP such that 

ZeTo(HYP) =[jZeTo{ASjm), 



in which each IPi is the initial-product of the corresponding asc-set j^i. If 

Remdi {CONC /J^i) = 0, (10) 

then the theorem T will be true on that part Zero{ASi / IPi) of Zero{HYP) or 

Zero{ASjm) C Zero(CCWC'). (11) 

Conversely, if ASi is irreducible and the theorem T is true on the part 
Zero(AS'i /IPi) or (11) is true, then we have (10). 
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Variety-Decomposition Theorem (7) Principle-Method III. For a 

given theorem T = {HYP.CDNC} let {IRRk} be a set of irreducible asc-sets 
determined from the hypothesis polset HYP such that 

Zero(iiYP) = |J Var[/M?fc]. 

k 

Then the theorem T will be true on that irreducible component Ya,r[IRRk] if and 
only if 

Remdr {CONC/IRRk) = 0. (12) 

We add some remarks below. 

In Principle-Method I the condition (9) is usually inevitable for the theorem 
T = {HYP, (JONC{ in question to be true. In that case we shall say that the 
theorem T is generically true under the non- degeneracy conditions (9). 

For theorems of form T = {HYP, CONC{ Principles I-III give three different 
methods of geometry theorem-proving. Principle I is the simplest to apply in that 
geometrically uninteresting or even meaningless degeneracy cases IP = 0 have 
been deprived off at the outset, though somewhat incomplete in that theorems 
will only be proved in the generic sense. The method in Principle III is the 
most complete in that no non-degeneracy conditions will be involved but it is 
not so easy to apply. Remark however that it requires only the simple algebraic 
verification of (12), but not the usual difficult determination of the Chow basis. 

We have restricted ourselves to the case of K being of characteristic 0. How- 
ever, so far theorem-proving is concerned, the proving of theorems in finite ge- 
ometries for which the basic field K is of prime characteristic is of high interest 
in itself. The more so since finite geometries are closely related to such techno- 
logical problems in coding theory and cryptography. We shall nonetheless leave 
this topic in referring only to some interesting original papers of D. Lin and 
Z.-J. Liu. 

In the above we have also restricted ourselves to theorems involving equations 
and inequations alone. However, in the case of K being the real field R, theo- 
rems involving inequalities are of high interest in themselves. It is well-known 
that by A. Seidenberg’s technique we can reduce such theorems to those involv- 
ing equations and inequations alone by introducing new unknowns if necessary. 
In appearance the above Principle-Methods may still seems to be applicable. 
However, the actual carrying-out becomes so intricate that only very limited 
success has ever been achieved. Therefore we shall satisfy ourselves by stating 
only the interesting theorem below. 

8-Triads Theorem. From 8 mutually different points Ai, i = 1, ... ,8, in cir- 
cular order with Ai+g = Ai, there may be formed 8 triads Ti = AiAi^iAi^^, 
i = 1, . . . , 8. If all of these triads are collinear, then generically all the 8 points 
will lie on the same line. 

Our methods permit to give the non-degeneracy conditions automatically 
and explicitly. These conditions are however so intricate to have any evident 
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geometrical meaning and to be foreseen. In the case of complex affine plane 
there are thus oo® 8-triad configurations such that all the points Ai are different 
from each other and all the 8 triads AiAi^iAi +3 are collinear but A 1 A 2 • • • Ag 
are not on the same line. The situation will however be entirely different if 
we restrict the configurations to be in the real affine plane. In fact S.-C. Chou, 
X.-S. Gao and others have proved the following beautiful theorem and discovered 
all the possible degenerate 8-triad configurations in the real affine plane. 

Theorem. In the real affine plane there exist no 8-triad configurations for 
which Ai (i = 1 , . . . , 8 ) are different from each other and all triads AiAi+iAi +3 
are collinear while A 1 A 2 • ■ ■ Ag, are not on the same line. 

Finally we remark that all the above are possible owing to the introduction 
of coordinate systems so that our methods of polynomial manipulations become 
available. There are so many beautiful theorems which cannot even be stated 
without the intervention of coordinate systems. Among these we may cite, for 
example, the syzygy-theorems of triads of points on a complex planar cubic curve 
and the 27-line theorem of cubic surface in a complex projective space. Remark 
that the last one has been considered as one of the gems hidden in the rag-bag of 
projective-geometry theorems (quite beautiful, however) discovered in the 19th 
century. 

The study of geometrical problems, including in particular problems of geo- 
metrical constructions and geometrical locus-determinations, may also in general 
be reduced to polynomial equations-solving in introducing suitable coordinate 
systems. Besides those problems in ancient Greek already mentioned at the be- 
ginning of the present paper as well as the classical ones like the Appolonius 
Problem and the Malfatti Problem, we shall mention a few further problems 
below which have been notably studied by members of MMRG. 

Zassenhaus Problem. Given 3 bisectors, whether interior or exterior, of the 3 
angles of a triangle, construct the triangle. 

It is trivial to construct bisectors of angles of a triangle by ruler-compass 
alone. The converse, as stated in the above Zassenhaus Problem, is not an easy 
matter. In fact, X.-S. Gao and D.-K. Wang gave the answer in negative in that 
such triangles cannot be constructed by ruler-compass alone. The proof consists 
in showing that the side-squares of the triangle should satisfy some irreducible 
equations of degree 10 with coefficients in the given bisectors. 

Erdos Problem. Given in the plane 10 points Ay {i,j = 1,...,5; i < j), 
determine 5 points Bi, i = 1, . . . , 5, such that for each of the above pair (i, j). Ay 
is on the line BiBj. 

The final answer due to H. Shi and H. Li is that there are generically 6 such 
quintuples of points Bi which can be explicitly determined from the given Ay . 

Pyramid Problem. Given a regular pyramid of square base, determine planes 
which will cut the pyramid into a regular pentagon. 
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The problem was solved by D.-K. Wang in giving a necessary and sufficient 
condition for the pyramid to have such regular-pentagonal cuts. Such planar cuts, 
in case existent, may be explicitly constructed. Moreover, in that case there will 
be regular-star cuts too which may also be explicitly constructed. 

We remark that, all the problems above are solved by our methods of polyno- 
mial equations-solving, after some suitable coordinate system has been chosen. 
Clearly an immense number of such problems may be treated in the same man- 
ner. In fact, various problems arising from partial differential equations, celestial 
mechanics, theoretical physics, and technologies have been successfully treated 
in this way. 

So far we have not mentioned at all the theorems (4) and (6) involving 
separants. In fact, the concept of separant plays an important role in studying 
singularities of algebraic varieties. Moreover, based on theorem (6), the present 
author has shown that for optimum problems with real object function and real 
restraint conditions all polynomial ones, there is some finite set of real values, 
whose greatest or least value is just the greatest or least value of the original 
optimum problem, so far it is supposed to exist. This gives in particular a general 
method of solving problems in non-linear programming and proving a lot of 
inequalities, algebraic, geometric, trigonometric, etc. For more details we refer 
to relevant papers and a forthcoming book of the present author. 

3 Some Historical Discussions 

From the above we see that both geometrical theorem-proving and geometrical 
problem-solving may be reduced to polynomial equations-solving which paves the 
way to the grandiose state of wide applications. We owe this much to the classic 
of Descartes in 1637. In his classic Descartes emphasized geometry problem- 
solving instead of geometry theorem-proving and showed how such geometry 
problem-solving may be reduced to polynomial equations-solving. Moreover, in 
a posthumous work Descartes had even the idea of reducing any kind of problem- 
solving, not merely geometrical ones, to the solving of polynomial equations in 
the form of some general scheme. Below we shall reproduce some paragraphs 
from a classic of G. Polya, viz. {{Mathematical Discovery)) . In that classic is 
described such a rough outline of Cartesian scheme: 

I First, reduce any kind of problem to a mathematical problem. 

Second, reduce any kind of a mathematical problem to a problem of algebra. 

Third, reduce any problem of algebra to the solution of a single equation. ] 

The third step may be further clarified as shown in the paragraph below 
which we again reproduce from the above classic of Polya: 

I Descartes advises us to set up as many equations as there are unknowns. 
Let n stand for the number of unknowns, and xi, 2 : 2 , . . . , in for the unknowns 
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themselves; then we can write the desired system of equations in the form 

ri(a;i,a; 2 , . . . ,a;„) = 0, 

r2{Xi,X2, ...,Xn)=0, 
rn{xi,X2, ...,Xn)=0, 

where ri(xi, 2:2, . . • , x„) indicates a polynomial in xi,X2, ■ ■ ■ , Xn, and the left- 
hand sides of the following equations must be similarly interpreted. Descartes 
advises us further to reduce this system of equations to one final equation. ] 

It seems that Descartes was absolutely absurd in reducing any kind of prob- 
lem to one of polynomial equations-solving. In Descartes’ time differential cal- 
culus had not yet been discovered so it is natural that Descartes had no idea 
about differential equations and other kinds of equations but the polynomial 
ones. However, even a problem can be so reduced to polynomial equations, the 
number of equations may not be the same as the number of unknowns: it may be 
smaller or greater. Even the numbers of equations and unknowns are the same, 
how to reduce such a system of polynomial equations to the solving of equa- 
tions involving a single unknown is not at all evident and is actually entirely 
non-trivial. Even if it can be so reduced, there may occur redundant solutions 
or loss of solutions. As pointed out by Polya in the above-mentioned classic, the 
reduced equations in single unknowns may furnish eventually the whole set of 
solutions only “in general,” e.g. in the “regular case.” However, the clarification 
of “in general” or “regular case” will be far from being trivial, as may be seen 
already from the simplest case in which are all linear in the x’s. In fact, this 
is what we have tried to achieve in the form of Zero-Decomposition Theorems 
as described in Section 1 . 

However, in spite of the apparent absurdity of Descartes’ Program, the idea 
of Descartes is of highest significance to the development of mathematics. In this 
respect let us reproduce some paragraph again from the same classic of Polya: 

I Descartes’ project failed, but it was a great project and even in its failure it 
influenced science much more than a thousand and one little projects which hap- 
pened to succeed. Although Descartes’ scheme does not work in all cases, it does 
work in an inexhaustible variety of cases, among which there is an inexhaustible 
variety of important cases. ] 

In fact, now-a-days we see that there are really inexhaustible problems from 
mathematics, from sciences, from technologies, etc. which will ultimately be re- 
duced to polynomial equations-solving. A few of such particular problems have 
been indicated in Section 2 above. Now, among all kinds of systems of equa- 
tions to be solved, those of polynomial forms are clearly the simplest, and the 
most fundamental ones, but at the same time a problem entirely non-trivial at 
all. Among all kinds of problems in mathematics, the problem of polynomial 
equations-solving should, therefore, be considered as one of primary importance 
to be settled with highest priority. 
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Let us consider now the state of geometry problem-solving and geometry 
problem-solving in our ancient China. For this sake let us reproduce some para- 
graphs from a previous paper of the present author appeared in 
{{Proc. ATOM ’95)) (Singapore, 1996), pp. 66-72: 

I The geometry theorem-proving in ancient Greece is well-known. As for 
in ancient China, one considered usually geometry problem-solving rather than 
geometry theorem-proving. Most of geometry problems to be solved arose from 
practical requirements. We may cite in particular: 

(I) Problems arising from measurements, . . . 

(II) Problems about area and volume determination, . . . 

(III) Problems involving Gou-Gu Form (i.e. a right-angled triangle), . . . e.g. 
the determination of such forms with 3 sides in rational ratios. 

(IV) Problems arising from actual lifes, . . . 

A lot of such problems with their solutions appeared already in the two 
earliest classics fortunately preserved up to the present day, viz. {{Nine Chapters 
of Arithmetic)) completed in Ic B.C., and {{Zhou Bi Mathematics Manuel)), 
believed to be no later than 100 B.C. To these two classics we may add the more 
important classic {{Annotations to Nine Chapters » due to Liu Hui in 263 A.D. 

The results of the geometry problems solved were usually expressed in the 
form of Shui which may sometimes be interpreted as theorems in the form of 
explicit formulas. 

Instead of set of axioms the ancient Chinese formulated rather some simple 
and plausible principles and then proved the geometry theorems in the form of 
formulas or solved the geometry problems by logical reasoning. 

The simplest one of such principles is perhaps the Out-In Complementary 
Principle: When a planar (or a solid) figure be cut into pieces, moved to some- 
where else, and combined together, the area (or volume) of the figure will remain 
the same. 

Diverse applications including geometry formula-proving and geometry prob- 
lem-solving have been made by means of this seemingly trivial principle in an 
elegant and sometimes unbelievably striking manner. . . . We may cite also the 
proof of the general Gou-Gu formula (i.e. the Pythagorous Theorem) based on 
the above Principle. The proof appeared in {{Zhou Bi)) and was attributed to 
some scholar Shang Gao in year around 1122 B.C. 



A turning point about geometry problem-solving occurred in Song 
Dynasty (-1-960, -1-1274) and Yuan Dynasty (-1-1271, -1-1368). During that period 
the notions of Heaven Element, Earth Element, etc. (equivalent to the present 
day unknowns x,y, etc.), together with allied notions of polynomials and meth- 
ods of elimination, were introduced. This furnishes a systematic way of solving 
geometry problems in the following way. Designate the unknowns of the geom- 
etry problems to be determined by Heaven Element, Earth Element, etc. The 
geometry conditions about the geometry entities involved will be turned into 
some algebraic relations between them which are usually equivalent to polyno- 
mial equations in modern form. By eliminating the unknown elements in succes- 
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sion we get finally equations each time in one alone of the unknown elements. 
When the known data are in numerical values, what is usually the case, we may 
solve these equations in succession by means of methods of numerical solving 
developed in Song Dynasty. The results will give then the required solutions 
of the geometry problems in question. We remark in passing that polynomial 
equations-solving instead of geometry theorem-proving was the main theme of 
studies throughout the whole history of mathematics in ancient China. 

The method created in that period was called the Heaven Element Method. It 
was in essence a method of algebrization of geometry which permits to reduce in 
relatively trivial manner geometry problems to the algebraic problems of solving 
of systems of polynomial equations. Elimination method, which appeared already 
in {{Nine Chapters » for the solving of systems of linear equations (equivalent to 
present day Gaussian elimination) was also developed to the solving of general 
systems of polynomial equations in the period of Song- Yuan Dynasties. Numer- 
ous examples may be found in some still existent classics. Unfortunately the 
development of Chinese ancient mathematics stopped at this crucial moment 
during the period of Yuan and Ming Dynasties. 

The algebrization of geometry, created in ancient China and stopped in 
the 14th century, was revived in Europe in the 17th century. It was even system- 
atized to the form of analytic geometry owing to the creation of R. Descartes. 
Descartes emphasized in his 1637 classic on geometry the geometry problem- 
solving rather than geometry theorem-proving. This was just in the same spirit 
as in ancient China. The analytic geometry also permits to prove geometry the- 
orems by mere computations in contrast to the proving by purely geometrical 
reasoning of Euclid. The classic of Descartes showed also the way of reducing ge- 
ometry problems-solving to polynomial equations-solving, which was just what 
the Chinese scholars in Song and Yuan Dynasties had tried to do. 

In a posthumous work Descartes had founded some doctrine in saying that 
all problems can be reduced to the solving of problems in mathematics, then 
to problems in algebra, then to problems of polynomial equations-solving, and 
finally to problems of solving algebraic equations in single unknowns. In compar- 
ing with works of our ancestors, we see that our ancestors and Descartes were 
on the same lines of thought in the development of mathematics. 

The computational method of solving geometry problems (including proving 
geometry theorems) in reducing them to the solving of polynomial equations 
lacked however a general way of dealing with the associated system of equations 
usually in embarrassing confused form. 

In recent years the Chinese mathematicians, mainly in MMRC (Math. Mech. 
Res. Center, Institute of Systems Science, Chinese Academy of Sciences), have 
developed a general mechanization method for solving of arbitrary systems of 
polynomial equations which permits to be applied to geometry problem-solving, 
in particular geometry theorem-proving. Our achievements may be considered 
as a continuation of what our ancestors had founded as well as partial accom- 
plishments of Descartes doctrine. 

1 
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In conclusion, it would be interesting in comparing the way of Descartes in 
establishing equations for problem-solving with the way of our ancestors in intro- 
ducing unknowns called Heaven Element, etc. and then establishing equations 
for problem-solving as described above. In a word, it may be said that Chinese 
ancient mathematics in the main was developed along the way as indicated in 
Descartes’ Program, and conversely, Descartes’ Program may be considered as 
an overview of the way of developments of Chinese ancient mathematics. 
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Abstract. Many problems arising in real geometry can be formulated 
as first-order formulas. Thus quantifier elimination can be used to solve 
these problems. In this note, we discuss the applicability of implemented 
quantifier elimination algorithms for solving geometrical problems. In 
particular, we demonstrate how the tools of redlog can be applied to 
solve a real implicitization problem, namely the Enneper surface. 



1 Introduction 

Since Tarski introduced the first quantifier elimination algorithm for the real 
numbers in the 1930’s (cf. [21]), several other algorithms have been developed. 
Only some of them are implemented and widely available. Among them there 
are, for instance, the quantifier elimination by partial cylindrical algebraic de- 
composition implemented in the C-program QEPCAD, the quantifier elimination 
by real root counting implemented in the module QERRC within the Modula II 
computer algebra system MAS and the quantifier elimination by virtual substitu- 
tion implemented in the REDUCE-package redlog. Whereas QEPCAD and QERRC 
provide a complete quantifier elimination procedure, the quantifier elimination 
of REDLOG is restricted in the degree of the quantified variables. 

The naive approach to use one of the implemented quantifier elimination 
algorithms for solving a problem often fails in practice. Firstly, the programs 
can fail due to the limitations of computing time and memory. Secondly, the 
chosen quantifier elimination algorithm may not be able to handle the wanted 
class of problems. 

Nevertheless it is possible to solve non-trivial problems using the imple- 
mented quantifier elimination algorithms. In general this requires both a combi- 
nation of the available algorithms and a careful formulation of the problem as a 
first-order formula according to the chosen quantifier elimination method. 

We give solutions of problems cited in the literature and compare the re- 
sults of the different elimination procedures. In particular we discuss the real 
implicitization of the Enneper surface. 
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2 The General Framework 

We consider multivariate polynomials f(u,x) with rational coefficients, where 
u = (ui...,Um) and x = (xi, . . . , a;„). We call u parameters and we call x 
main variables. Atomic formulas are polynomial equations / = 0, polynomial 
inequalities / ^ 0, / < 0, / > 0, / < 0, and / 0. Quantifier-free formulas 

are built from atomic formulas by combining them with the boolean connec- 
tors “- 1 ,” “A,” and “V.” Existential formulas are of the form 3a;i . . . 3xntf{u, x), 
where tp is a, quantifier-free formula. Similarly, universal formulas are of the form 
\/xi . . . yxnfp{u, x). A prenex first-order formula has several alternating blocks of 
existential and universal quantifiers in front of a quantifier-free formula, the 
matrix of the prenex formula. 

The real quantifier elimination problem can be phrased as follows: Given a 
formula find a quantifier-free formula ip' such that both ip and p' are equivalent 
in the domain of the real numbers. A procedure computing such a p' from p is 
called a real quantifier elimination procedure. 

A background theory is a set of atomic formulas considered conjunctive. Two 
formulas p and p' are equivalent wrt. a background theory 0, if and only if 



where V denotes the universal closure. Notice, that the formula x^ — x = 0 
contained in a background theory, does not imply a multiplicative idempotency 
law, but only describes an equation for the variable x. 

A generic quantifier elimination procedure assigns to a formula p a quantifier- 
free formula p' and a background theory O' , such that p and p' are equivalent 
wrt. the background theory O' . The computed background theory contains only 
negated equations. Hence the set 



has measure zero. Notice, that each quantifier elimination procedure provides 
also a generic quantifier elimination assigning 0 to 0b However, the computation 
of an appropriate O' leads in many cases to simpler quantifier-free equivalents 
and results in a considerable speed-up of the running times. Moreover, in the 
framework of automatic proving in real geometry generic quantifier elimination 
allows us to find sufficient conditions automatically, cf. [13] for details. 

3 Quantifier Elimination Methods 





In this section we sketch the three most important implemented quantifier- 
elimination algorithms. For a more detailed overview over these 
algorithms, cf. [14]. 
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3.1 Quantifier Elimination by Partial Cylindrical Algebraic 
Decomposition 

Cylindrical algebraic decomposition (cad), cf. [6,1], is the oldest and most elab- 
orate implemented real quantifier elimination method. It has been developed 
by Collins and his students starting in 1974. During the last 10 years partic- 
ularly Hong made very significant theoretical contributions that improved the 
performance of the method dramatically, cf. [16,17], resulting in partial CAD, 
cf. [7]. Hong has implemented partial CAD in his program QEPCAD based on 
the computer algebra C-library SACLIB. QEPCAD is not officially published but 
available from Hong on request. Brown has contributed the latest very successful 
improvements to the algorithm, cf. [4] . Unfortunately, the improved version was 
not available when we made the computations for this article. 

We sketch the basic ideas of cad: Suppose we are given an input formula 

, . . . , UjYi ) = QiXi . . . Q„Xni>{ui , . . . , lljn 5 xi , . . . , Xti), € {3, V}. 

Let F be the set of all polynomials occurring in ip as left hand sides of atomic 
formulas. Call C C sign invariant for F, if every polynomial in F has 

constant sign on all points in C. Then ip{c) is either “true” or “false” for all 
ceC. 

Suppose we have a finite sequence 77i, . . . , Flm+n with the following proper- 
ties: 

1. Each Fli is a finite partition of into connected semi-algebraic cells. For 
^ ^ j each Fl^+j is labeled with Qj . 

2. 7Ti_i consists for 1 < i ^ m + n exactly of the projections of all cells in 
Fli along the coordinate of the i-th variable in (ui, . . . , Um, . . . ,Xn)- For 
each cell C in 7Ti_i we can determine the preimage S{C) C Fli under the 
projection. 

3. For each cell C in TJm we know a quantifier-free formula Sc(ui, . . . ,Um) 
describing this cell. 

4. Each cell C in Flm+n is sign invariant for F. Moreover for each cell C in 
Fm+m we are given a test point tc in such a form that we can determine 
the sign of /(tc) for each f G F and thus evaluate ip{tc)- 

A quantifier-free equivalent for ip is obtained as the disjunction of all 6c for 
which C in Flm is valid in the following recursively defined sense: 

1. For m i < m + n, we have that lli^i is labeled: 

(a) If 7Ti_|_i is labeled “3,” then C in Fli is valid if at least one C G S{C) is 
valid. 

(b) If 7Ti+i is labeled “V,” then C in Fli is valid if all C G S{C) are valid. 

2. A cell C in Flm+n is valid if ip{tc) is “true.” 

Partial cylindrical algebraic decomposition (pcad) is an improved version of 
CAD. It takes the boolean structure of the input formula into account. CAD and 
also PCAD is doubly exponential in the number of all variables. 
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3.2 Quantifier Elimination by Virtual Term Substitution 

The virtual substitution method dates back to a theoretical paper by Weispfen- 
ning in 1988, cf. [22]. During the last five years a lot of theoretical work has been 
done to improve the method, cf. [18,23,11]. The method is implemented within 
the REDUCE package redlog by the author and Sturm, cf. [10]. 

The applicability of the method in the form described here is restricted to 
formulas in which the quantified variables occur at most quadratically. Moreover 
quantifiers are eliminated one by one, and the elimination of one quantifier can 
increase the degree of other quantified variables. On the other hand there are 
various heuristic methods included for decreasing the degrees during elimination. 
One obvious example for such methods is polynomial factorization. 

For eliminating the quantifiers from an input formula 

ip{ui ) = QiXi . . . Q„Xni>{ui , . . . , U-fji I Xi , . . . , Xn}j Qz € {^5 

the elimination starts with the innermost quantifier regarding the other quanti- 
fied variables within tp as extra parameters. Universal quantifiers are handled by 
means of the equivalence Vxip < — > -^Bx-iip. We may thus restrict our attention 
to a formula of the form 

ip*(ui , . . . ,Ufc) = Bxip*(ui, . . ,,Uk,x), 

where the Um+i, ■ ■ ■ , Uk are actually Xi quantified from further outside. 

We fix real values ai, . . . , Ofc for the parameters ui, . . . , Uk- Then all poly- 
nomials occurring in ip* become linear or quadratic univariate polynomials in x 
with real coefficients. So the set 



M = { 6 e K 1 ip*{ai , . . . , Ofc, 6) } 

of all real values b of x satisfying ip* is a finite union of closed, open, and half- 
open intervals on the real line. The endpoints of these intervals are among ±(X) 
together with the real zeros of the linear and quadratic polynomials occurring 
in Ip*. 

Candidate terms ai, . . . , ar for the zeros can be computed uniformly in ui, 
. . . , Mfe by the solution formulas for linear and quadratic equations. For open and 
half-open intervals, we add expressions of the form a ± e, where a is a candidate 
solution for some left-hand side polynomial. The symbol e stands for a positive 
infinitesimal number. Together with the formal expressions oo and — oo all these 
candidate terms form an elimination set. This means M is non-empty if and only 
if the substitution of at least one element of the elimination set satisfies ip* . After 
substitution of formal expressions possibly involving square roots, e, or ±oo, 
we rewrite the substitution result in such a way that it does not contain any 
fractions nor one of these special symbols. This process of substituting a term and 
rewriting the formula is called virtual substitution. By disjunctively substituting 
all candidates into ip* we obtain a quantifier-free formula equivalent to 3xip* . 

For practical applications this method, of course, has to be refined by a 
careful selection of smaller elimination sets and by a combination with powerful 
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simplification techniques for quantifier-free formulas, cf. [11] for details. There 
is a variant of the virtual substitution method for generic quantifier elimination, 
cf. [13]. 

Quantifier elimination by virtual substitution is doubly exponential in the 
number of quantifier blocks but only singly exponential in the number of quan- 
tified variables. The number of parameters only plays a minor role in the com- 
plexity. 

3.3 Quantifier Elimination by Real Root Counting 

The basis for this quantifier elimination method is a theorem on real root count- 
ing found independently by Becker and Wormann, cf. [2,3], and Pedersen, Roy, 
and Szpirglas, cf. [19,20]. It is based on a result for counting real zeros of uni- 
variate polynomials found by Hermite, cf. [15]. 

Let I C K[xi, . . . , Xn] he a zero-dimensional ideal. For g £ R[a;i, . . . , Xn] 
consider the symmetric quadratic form Qg = (trace(mg{,^{,j. on 

the linear K-space S = K[a;i, . . . , Xn]/ 1 , where {bi , . . . , bj} C S is a basis, 
and the irih : S' — > S' are linear maps defined by mh{f 1) = (hf) 1. 

Let s he the signature of Qg, and denote by n-^. and n_ the number of real 
roots of I at which g is positive or negative, respectively. Then n_|_ — n_ = 
s. 

The use of a Grobner basis of the ideal / allows to obtain a basis of S, and to 
perform arithmetic there, cf. [5], thus obtaining the matrix Qg. 

This approach can be extended to obtain the exact number of roots under a 
side condition, and can be moreover extended to several side conditions: 

Let F, {gi, . . . ,gk} Q K[a;i, . . . ,a;„] be finite, and assume that L = Id{F) 
is zero-dimensional. Denote by N the number of real roots a € K” of F 
for which gi > 0 for 1 ^ z ^ fc. Define 

Then defining Qg as above we have 2^N = g^}) sig(Q 7 )- 

For real quantifier elimination, this root counting has to be further extended 
to multivariate polynomials with parametric coefficients in such a way that it 
will remain correct for every real specialization of the parameters including spe- 
cializations to zero. This task has been carried out by Weispfenning using com- 
prehensive Grobner bases, cf. [24]. 

The case of ideals that have a dimension greater than 0 is handled by re- 
cursive calls of the approach for zero-dimensional ideals: We construct from one 
polynomial system several polynomial systems in such a way, that each of the 
systems has either a smaller number of variables or a smaller dimension. 

The complete method has been implemented by the author within the pack- 
age QERRC of the computer algebra system MAS, cf. [9]. A variant for generic 
quantifier elimination is under development. 
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4 Simplification Methods 

Simplification of formulas means to compute to a given formula an equivalent 
one, which is simpler. The simplification of the input formulas and intermediate 
results are very important for a successful application of quantifier elimination by 
virtual substitution and for quantifier elimination by real root counting. QEPCAD 
does not depend heavily on simplification of formulas, because formulas are 
used only for input and output. For a detailed description of the simplification 
algorithms sketched in this section and a discussion of the notion of simple 
formulas, cf. [11]. 

4.1 The Standard Simplifier 

The standard simplifier is a fast, though sophisticated, simplifier for quantifier- 
free formulas. It was developed for the implementation of the quantifier elimina- 
tion by virtual term substitution in the redlog package. The standard simplifier 
was designed to be called very often, for instance, after each elimination step. 
Beside the simple methods to remove boolean constants and to keep only one of 
a set of identical subformulas, it implements three main strategies: 

Firstly, it simplifies the atomic formulas. All right hand side of atomic formu- 
las are normalized to zero. The left hand side polynomials are normalized to be 
primitive over Z in such a way that the highest coefficient wrt. a fixed term order 
is positive. Furthermore, we drop irrelevant factors of the polynomials. Trivial 
square sums are detected to be greater than zero or not less than zero, respec- 
tively. Optionally, we explode atomic formulas by decomposing the polynomial 
additively or multiplicatively. 

Secondly, smart simplification is applied to conjunctions and disjunctions of 
atomic formulas: Each pair of atomic formulas involving identical left-hand sides, 
is replaced by “true,” “false,” or by one atomic formula using an appropriate 
relation. Similarly, this method can be applied to some pairs of atomic formulas 
which differs only in the absolute summand of the left-hand side polynomial. 

Thirdly, the techniques used for the smart simplification of flat formulas are 
applied to nested formulas. This is done by constructing an implicit background 
theory for each boolean level of the formula. 

The standard simplifier offers the option to simplify a formula wrt. a back- 
ground theory. The output formula is then equivalent to the input formula 
wrt. the given background theory. 

4.2 Simplification of Boolean Normal Forms 

For the simplification of boolean normal forms we use two further techniques 
additionally to the techniques used in the standard simplifier. 

The generalized subsumption allows us to drop conjunctions of a CNF: Let ti 
terms and p^,pi S Then 



{ti Pi 0 A ... A tn PnO) \/ {ti p[0 A ... A tn p'„0 A ■■■) 
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can be simplified to (ii pi 0 A . . . A 0), if U p[ 0 — > U pi 0. 

The generalized cut combines two conjunctions combined disjunctively into 
one conjunction: if (ti piQM ti p' 0) < — > ti Oi 0 then the disjunction 



(ti Pi 0 A t2 P2 0 A . . . A 0) V (fi Pi 0 A t2 P2 0 A . . . A p„ 0) 

can be simplified to (ti cti 0 A t 2 P 2 0 A ... A tn Pn 0). In our implemen- 
tation, not all possible applications of the generalized cut and the generalized 
subsumption are performed. We only apply the simplifications in cases, where the 
respective implication can be decided independently of the terms ti . Analogous 
simplification rules hold for conjunctions of disjunctions. 

4.3 The Grdbner Simplifier 

The Grohner simplifier is an advanced method for the simplification of boolean 
normal forms. The main technique used for this method is the computation of a 
Grobner basis for deciding the ideal membership test, cf. [5]. Let T" be a set of 
polynomials and G a Grobner basis of F. Then the Grobner simplifier replaces 

/\ ./ = 0 A /\ /ipftO by f\ f = 0 A f\ ma{h) phO. 

feF heH feG hgh 

Using a radical membership test, we can optionally replace the input formula by 
false, if h £ Rad(F) and ph £ >}• 

There is, of course, a variant which simplifies disjunctions of atomic for- 
mulas. Gonjunctive and disjunctive normal forms are simplified essentially by 
applying the Grobner simplifier to each of the constituents. Additionally there 
are strategies to relate information contained in different constituents. For the 
simplification of an arbitrary formula, we compute a CNF or a dnf first. We 
have implemented a variant of the Grobner simplifier which automatically de- 
cides, whether to compute a CNF or a dnf. This decision is based on a heuristic 
for estimating which of the normal forms is larger. Like the standard simplifier 
the Grobner simplifier can simplify a formula wrt. a background theory. 

Although the Grobner simplifier operates on a boolean normal form, we con- 
sider it as a general simplification method. It turned out that in many cases the 
result of the Grobner simplification is simpler than the input formula. 

We illustrate the technique by means of a very simple example: Gonsider the 
input formula xy + ly^0\/yz + ly^0\/x — z = 0, which can be rewritten as 

xy+\ = f) A yz + l = f) — > x — z = f). 

Reducing the conclusion modulo the Grobner basis {x— z^y z+\\ of the premises, 
we obtain the equivalent formula xy + l = Q A yz + l = Q — *■ 0 = 0, which can 
in turn be easily simplified to “true.” 
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4.4 The Tableau Method 

Although the standard simplifier combines information located on different lev- 
els, it preserves the basic boolean structure of the formula. The Grobner simpli- 
fier computes a boolean normal from and thus it changes the boolean structure 
completely. The tableau method, in contrast, provides a technique for changing 
the boolean structure of a formula slightly by constructing case distinctions. 
There are three variants of tableau simplifiers: 

Given a formula ip and a term t the tableau simplifier constructs the following 
case distinction: 



(t = 0 A V (t > 0 A V (t < 0 A <p). 

This formula is then simplified with the standard simplifier. Even though the 
constructed case distinction is about three times larger than the original formula, 
in some cases the simplified equivalent is much smaller than the original formula. 

The iterative tableau simplifier chooses automatically an appropriate term t 
by trying all terms contained in ip. The result is then either the original formula 
or the best result obtained by the tableau simplifier, depending on which one is 
simpler. The automatic tableau simplifier applies the iterative tableau as long as 
the output is simpler than the input. 



5 REDLOG 

REDLOG is a REDUCE package implementing a computer logic system, cf. [10]. It 
provides algorithms to deal with first-order logic formulas over various languages 
and theories. Beside the theory of real closed fields there are algorithms for the 
theory of discretely valued fields [12] and algebraically closed fields, redlog 
provides an implementation of the quantifier elimination by virtual substitution 
including variants for generic quantifier elimination, extended quantifier elimi- 
nation, and linear optimization using quantifier elimination techniques. There 
are also interfaces to QEPCAD and QERRC available such that these packages can 
be called from REDLOG and the results are available to be further processed. 
All simplification algorithms discussed in this note are available in REDLOG. Be- 
sides these important algorithms redlog provides many tools for normal form 
computations, constructing, analyzing, and decomposing formulas. The redlog 
source code and documentation are freely available on the www.^ 

6 Examples 

By means of examples we discuss in this section the applicability of quanti- 
fier elimination to some problems taken from various scientific research areas. 
We compare the timings of the three considered implementations. Furthermore 

http : //www.fmi .uni-passau.de/~redlog/ 
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we will give some hints, how to apply quantifier elimination, even if the naive 
approach fails. 

In this section we allow ourselves to use not only prenex formulas but also 
general first-order formulas, i.e. boolean combinations of prenex formulas. Fur- 
thermore, we use the implication and the equivalence inside formulas. 
Notice, that these general formulas can be easily transformed in prenex formu- 
las containing only the boolean connectors A and V. 

All example computations mentioned in this section have been performed on 
a SUN Ultra 1 Model 140 workstation using 32 MB of memory. 

6.1 Real Implicitization 

For n < m let f : K” ^ K™ be a rational map with the component func- 
tions Pi/q, where pi, q S . . . , Xn] for 1 < f < m. The image /(M") of / is a 
definable and hence semi-algebraic subset of K™ described by the formula 

m 

3xi . . ,3xn{q{xi ,. . . ,a;„) 0 A f\pi{xi, . . . ,Xn) = u^q(xi , . . . ,a;„)). 

i=l 

Our aim is to obtain a quantifier- free description of /(M”) in the 
variables ui, . . . , Um, preferably a single equation, which would provide an im- 
plicit definition of /. 

Example 1. Descartes’ folium c? : M — *■ is given by the component functions 

3xi/(l -I- xf) and 3xf/(l + xf) for ui and U 2 , cf. [8]. For obtaining an implicit 
form we apply quantifier elimination to 

3xi(l -I- 0 A 3a;i = ui(l + xf) A 3xf = U 2 (l + xf)). 

QEPCAD obtains after 1 s the result uf — 3uiU2 + uf = 0. QERRC obtains 
after 1.6 s an elimination result with 7 atomic formulas. This can be automati- 
cally simplified to the QEPCAD result using the Grobner simplifier. REDLOG fails 
on this example due to a violation of the degree restrictions. After simplifying 
the matrix of the formula with the Grobner simplifier for one’s we can eliminate 
the quantifier with REDLOG. The output formula computed in 0.3 s contains 21 
atomic formulas. 

Example 2. The Whitney Umbrella is given by 

{XI,X2) 1-^ (xiX2,X2,xl). 

QEPCAD produces in 1 s the result U3 ^ 0 A —uf + U 2 U^ = 0. Quantifier elimina- 
tion by virtual substitution (qevts) produces in 0.01 s the much longer result 

M3 ^ 0 A {uf — ufus = 0 A U 1 U 2 ^ 0 V uf — ufu'i = 0 A U\U 2 ^ 0). 

However, including a Grobner simplification we yield in 0.03 s the same result as 
produced by QEPCAD. Using QERRC we get the following quantifier-free equiva- 
lent in 0.8 s: 

M2 y^ 0 A —uf + M2M3 = 0VMi = 0AM2 = 0AM3-|-ly^0AM3^0. 
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6.2 Automatic Theorem Proving in Geometry 

Example 3 ( Steiner- Lehmus theorem, variant). Assume that ABC is a triangle 
such that AB > AC. Then the angle bisector from B to AC is longer than the 
angle bisector from C to AB (i.e., the longer bisector goes to the shorter side). 

In its original form, the Steiner-Lehmus theorem states that any triangle with 
two equal internal bisectors is isosceles. Its contrapositive follows immediately 
from the variant above. 

We put A = (—1,0), B = (1,0), and C = (xo,yo) with j/o > 0. By M = (0,6) 
we denote the center and by c the radius of the circumcircle. 

The bisectors are constructed using the geometrical theorem proved as Ex- 
ample 4: Let V = (0, 6 — c) be the point below the x-axis on the circumcircle 
having equal distance to A and B. Then the angle bisector from C to AB is 
obtained as CX, where X = (x,0) is the intersection of CV and AB. The angle 
bisector from B to AC is obtained analogously: Let W = (xi,yi) be the point 
on the circumcircle with equal distance to A and C lying “west” of the line AC. 
Let Y = (x 2 ,y 2 ) be the intersection of BW and AC. Then the angle bisector 
is BY. 

Our algebraic translation obtained this way reads as follows: 

V 6 VcVxVxiVyiVx 2 Vy 2 (yi(a;o + 1) > xiyo Ayo>0Ac>0 A 

= 1 -\-b^ A = Xq { yo — b)^ A x(yo + (c — 6)) = xq(c — 6) A 
xf + (yi - bf = A (xi -b 1)^ + yl = (a;i - a^o)^ + (yi - yof a 
( xi - l)y 2 = yi{X 2 - 1) A (xo + l)j /2 = yo{X 2 + 1) A 
4 > (xo -b 1)^ -b ?/o — ' (a; - a:o)^ + ~ 1)^ + vl) ■ 

Using QEVTS we obtain after 74 s an elimination result (p* containing 243 atomic 
formulas together with the subsidiary conditions 

1?* = Xq -b 2xo + yl + l^Q A 
Xg - 2xg -i-yl-3^0AxQ-\-1^0Axo^0Ayo^0. 

QEPCAD proves within 250 s that i3* — *■ tp*, while QEVTS, QERRC fail doing so. 

Example ). Let M be the center of the circumcircle of a triangle ABC. Then 
YACB = YAMB/2 (see Figure 1). 

Choosing coordinates A = (— a, 0), B = (a, 0), C = (xg,?/o), and M = (0,6) 
and encoding angles into tangents, an algebraic translation of this problem reads 
as follows: 

VxViiV 62 ViV 6 (c^ = -b 6^ A = Xg -b {yo — 6)^ A 
j/gti = a -b Xg A yot 2 = a — Xg A (1 — tit 2 )t = ti -\-t 2 — > bt = a). 

Both QEPCAD and QERRC fail on this input. QEVTS together with the Grobner 
simplifier yields after 0.06 s the quantifier-free equivalent ay^OVxgyfOVyoyfO 
containing non-degeneracy conditions for the triangles. 
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Fig. 1. The angle at circumference is half the angle at center (Example 4). 



The generic variants of QEVTS and QERRC produce “true” as a quantifier-free 
equivalent wrt. the background theory {y yf 0}. This computation takes 0.02 s 
with QEVTS and 2 s with QERRC. 

6.3 The Enneper Surface 

The Enneper surface is defined parametrically by 

X = 3u + 3uv^ — y = 3v + 3u^v — z = 3u^ — 3v^, 
cf. [8]; in other words as the image of the function 
/ : with /(u, v) = (3u + 3uv^ — 3v + 3u^v — v^, 3u^ — 3v^). 

The smallest variety V containing the Enneper surface is given by the polynomial 

p{x, y, z) = 19683a;® - 59049 a;^ 2/2 i0935a;^z® -k 118098a;S2 - 59049a;^z -k 

59049x^2/^ -k 56862x2y^z® -k 118098x^^22 -k 1296x2^® -k 
34992x2^® -k 174960 x2z^ - 314928x2^® - 19683y® -k 10935yV - 
1180982/^2 _ 590492 /^z - I296y‘^z^ + 3i992y^z^ - 17i960y^z^ - 
3 1 49 282/^2® - 642® -k 103682^ - 419904z®. 

Using Grobner bases techniques it is easy to proof that over the complex numbers 
the image of / is identical to the complete complex variety V. The real Enneper 
surface is given by the restriction / • The question is whether the real Enneper 

surface is identical to the real variety of p. 

This problem can be easily stated as a real quantifier elimination problem. 
A first-order description of the real image of / is given by 

(p = 3u3v (x = 3u -k 3uv^ — A y = 3v + 3u^v — x® A 2 = 3^2 — 3r;2) . 
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Using this formula the complete problem can be stated as 

y, z) ^ p{x, y, z) = 0). 

It turns out, that none of the quantifier elimination algorithms considered in 
this paper can eliminate all the quantifiers of the above formula. Actually, none 
of the implementations is able to find a quantifier-free equivalent to ip. In the 
next paragraphs we will describe how to tackle the problem with the quantifier 
elimination algorithms and the tools available in the redlog system. 

In a first step we compute a quantifier-free equivalent to ip. With the im- 
plementation of the virtual substitution method we can eliminate one of the 
existential quantifiers of (p. The solutions of eliminating u or eliminating v are 
up to a simple substitution identical. We start with the elimination of v and 
obtain in 0.2 s the result 

3u(v 4 ^0A(r;i=0Ar;2 = 0Ar)3^0V?;i = 0A?;2 = 0Ar;3^ 0)) 
where 

vi := 108u® -b 324 m^ - 9uh^ - 54u^z + 243u^ - 27y‘^ - z^ - - 81z, 

V 2 ■= 2u^ — uz + 3u — X, 

V 3 := 6u^y + yz + 9y, 

V 4 := 3u^ — z. 

Applying the generalized cut to both constituents of the disjunction, we obtain 
the formula: 

3 u ( v 4 ^ 0 a ui = 0 a 'C2 = O) . 

REDLOG produces this formula by computing a disjunctive normal form. After 
this simplification only the package QERRG can eliminate the remaining quanti- 
fier, but the obtained elimination result p' is very large. Though it has only 81 
atomic formulas a textual representation contains approximately 500 000 char- 
acters. 

However, we have found a quantifier-free description of the image of the 
function describing the Enneper surface. One of the atomic formulas contained 
in the result is the equation p = 0. But this fact does not imply any direction of 
the equivalence we want to prove. 

The obtained result is in fact to large, to eliminate the universal quantifiers 
from 

Vxyyyz{p{x, y, z) 4A p(x, y, z) = O) 

with one of our quantifier elimination procedures. Actually, we are not able to 
eliminate at least one universal quantifier. Thus we use redlog to simplify p' . 

An analysis of the formula yields that the result formula is a disjunction of 
nine subformulas. Five of these nine are simply conjunctions of atomic formulas 
and the remaining four subformulas are essentially conjunctive normal forms. 

For a simplification of the formula we thus simplify each constituent of 
the top-level disjunctions with the Grobner simplifier for conjunctive normal 



26 



Andreas Dolzmann 



forms. In a first step, we take the Grobner simplifier without a factorization 
of atomic formulas. In a second step we use the Grobner simplifier with the 
automatic factorization option twice for each constituent of the top-level dis- 
junction. After these simplification we obtain a formula with 50 atomic formulas 
and about 30000 characters. All these simplifications takes 434 s. 

The automatic tableau method can simplify some of the resulting conjunctive 
normal forms in about 1 s. After these simplifications we get a disjunction of 
eight conjunctive normal forms. From these eight conjunctive normal forms are 
six pure conjunctions of atomic formulas. The two other ones are conjunctions 
containing atomic formulas and only one disjunction of two atomic formulas. 
Using the equivalents 

(a < 0 V /3 = 0) < — > (a • /3^ ^ 0) and (a = 0 V /3 = 0) < — > (a • /3 = 0) 

we can simplify the formula to a disjunctive normal form ij) with constituents ijji. 

Let 0i be the set of atomic formulas of tpi. Using the Grobner simplifier we 
can simplify p = 0 wrt. the theory 0i to “true.” This means that tpi — > p = 0 
and hence Tp — > p = 0. 

Using a tool of redlog, which counts the frequencies of all included atomic 
formulas we analyze our result. The most frequently occurring atomic formulas 
in Ip are 

a; = 0, y = 0, z — 3 ^ 0, and z ^ 0 

each of them with more than three occurrences. 

This observation suggests, that we should study a special case of the impli- 
cation. For arbitrary formulas !F, F and formulas Oi, . . . , with A • • • A 
contradictive the following equivalence holds: 

{F F) < — > (oi A • • • A an) V {F ^ F) 

< — > (^cii A •F) T) A . . . A {^an A F) ^ F) 

< — > (^ai (<F ^ T) A . . . A ^ {F ^ ^))- 

Using a; yf 0, y yf 0, z yf 0, 2 7 ^ 3, and (a; = 0Vy = 0Vz = 0Vz = 3)for 
the ai we can split our implication in a conjunction of five implications. Four of 
them include simple equations for one of the variables. 

For proving the cases including an equation we substitute the values 0 and 3 
for the respective variable in the formula p = 0 '0 then we eliminate the 

universal closure of the substitution result. The timings for this elimination are 
summarized in Table 1. The elimination result is in all cases “true.” 

After we have checked the special cases, we can exclude these cases by simpli- 
fying the formula tp with respect to the theory {a;yf0,yyf0,zyf0,zyf3}. This 
yields a disjunction of two conjunctions. One of the remaining constituents is 
contradictive as shown by the elimination of the existential closure with QEVTS 
in 0.8 s. Neither QEPCAD nor QERRC are able to prove this fact. The remaining 
constituent is a conjunction /3i A • • • A 04 of atomic formulas, where 0i is the 
equation p = 0. Instead of proving the complete implication, which is not pos- 
sible with our quantifier elimination procedures, we prove p = 0 0i for each 
atomic formula (3i in the conjunction. The first implication holds trivially. 
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Table 1. Elimination times for the special cases. 

X = 0 y = 0z = 0z = 3 
QEVTS failed 0.7 s 0.1 s failed 
QEPCAD 1.0 s 23.0 s 1.0 s 1.0 s 



While trying to eliminate one of the quantifiers redlog applies a heuristic 
to decrease the degree of the variables x and y. Namely, it replaces each oc- 
currence of x^, and by x and y respectively, adding the additional premise 
a; > 0 A y > 0. Finally the quantifier elimination fails. However, after the degree 
reduction QEPCAD is able to eliminate all universal quantifier. For the elimi- 
nation it is necessary to give the quantifiers in the order VzVyVa;. Altogether 
the three eliminations take 34 s. All results are “true” and thus we have finally 
proven the equivalence. 

7 Conclusions 

After approximately 50 years of development, quantifier algorithms can nowaday 
be used to solve geometrical problems. Even if a first approach fails, the several 
quantifier elimination methods can be applied in interaction with the user to 
appropriate subproblems, finally solving the problem. Our results show, that 
among the three considered algorithms there is no best algorithm, solving all 
problems. Furthermore they show that not only quantifier elimination algorithms 
are necessary but even powerful simplification algorithms and a wide spectrum 
of tools for analyzing and decomposing formulas. Important for the user is a 
common interface to all algorithms, like redlog provides one. 
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Abstract. Automated discovering and proving for geometric inequali- 
ties have been considered a difficult topic in the area of automated rea- 
soning for many years. Some well-known algorithms are complete theo- 
retically but inefficient in practice, and cannot verify non-trivial propo- 
sitions in batches. In this paper, we present an efficient algorithm to 
discover and prove a class of inequality-type theorems automatically by 
combining discriminant sequence for polynomials with Wu’s elimination 
and a partial cylindrical algebraic decomposition. Also this algorithm is 
applied to the classification of the real physical solutions of geometric 
constraint problems. Many geometric inequalities have been discovered 
by our program, DISCOVERER, which implements the algorithm in 
Maple. 



1 Introduction 

In the last 20 years, the efficiency of computer algorithms for automated rea- 
soning in both algebraic and logic approaches has greatly increased. One has 
reason to believe that computer will play a much more important role in reason- 
ing sciences in the coming century. People will be able to prove theorems class 
by class instead of one by one. Since Tarski’s well-known work [21], A Decision 
Method for Elementary Algebra and Geometry, published in the early 1950’s, 
the algebraic approaches have made remarkable progress in automated theorem 
proving. Tarski’s decision algorithm, which could not be used to verify any non- 
trivial algebraic or geometric propositions in practice because of its very high 
computational complexity, has only got theoretical significance. Some substan- 
tial progress was made by Seidenberg [20], Collins [12] and others afterwards, 
but it was still far away from mechanically proving non-trivial theorems batch 
by batch, and even class by class. The situation did not change until Wu [25,26] 
proposed in 1978 a new decision procedure for proving geometry theorems of 
“equality type,” i.e. the hypotheses and conclusions of the statements consist 

* This work is supported in part by CAS and CNRS under a cooperative project 
between CICA and LEIBNIZ, and in part by the National “973” Project of China. 
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of polynomial equations only. This is a very efficient method for mechanically 
proving elementary geometry theorems (of equality type) . Chou [8] has success- 
fully implemented Wu’s method for 512 examples which include almost all the 
well-known or historically interesting theorems in elementary geometry, and it 
was reported that for most of the examples the CPU time spent was only few 
seconds each, or less than 1 second! 

The success of Wu’s method has inspired in the world the advances of the al- 
gebraic approach [18,22,23] to automated theorem proving. In the past 20 years, 
some efficient provers have been developed based on different principles such 
as Grobner Bases [5,6] and Parallel Numerical Method [32]. Especially, Zhang 
and his colleagues [11,29] gave algorithms and programs for automatically pro- 
ducing readable proofs of geometry theorems. The achievement has made the 
studies in automated proving enter a new stage that the proofs created by ma- 
chines can compare with those by human being, while the decision problem was 
playing a leading role before. It has also important applications to mathematics 
mechanization and computer aided instruction (CAI). 

Those methods mentioned above are mainly valid to equality-type theorem 
proving. As for automated inequality discovering and proving, the progress has 
been slow for many years. Chou and others [9,10] made helpful approaches in 
this aspect by combining Wu’s method with CAD (Cylindrical Algebraic De- 
composition) algorithm or others. Recently, a so-called “dimension-decreasing” 
algorithm has been introduced by Yang [27]. Based on this algorithm, a pro- 
gram called “BOTTEMA” was implemented on a PC. More than 1000 algebraic 
and geometric inequalities including hundreds open problems have been verified 
in this way. The total CPU time spent for proving 120 basic inequalities from 
Bottema’s monograph “Geometric Inequalities,” on a Pentium/200, was 20-odd 
seconds only. To our knowledge, this is the first practical prover which can verify 
non-trivial inequality-type propositions in batches. As one of the applications, 
this makes it practical to solve a global-optimization problem by means of veri- 
fying finitely many inequalities. 

The reason for automated inequality discovering and proving to be a difficult 
topic is that the concerning algorithms depend fundamentally upon real algebra 
and real geometry, and some well-known algorithms are complete theoretically 
but inefficient in practice. In [30,34,31], Yang and others introduced a powerful 
tool, complete discrimination system (CDS) of polynomials, for inequality rea- 
soning. By means of CDS, together with Wu’s elimination and a partial CAD 
algorithm, we present an efficient algorithm to discover and prove a class of 
inequality-type theorems automatically. A program called “DISCOVERER” was 
implemented in Maple that is able to discover new inequalities automatically, 
without requiring us to put forward any conjectures beforehand. For example, by 
means of this program, we have re-discovered 37 inequalities in the first chapter 
of the famous monograph [19], “Recent Advances in Geometric Inequalities,” 
and found three mistakes there. 
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2 The Problem 

In general, the problem we study here is: 

Give the necessary and sufficient condition which the parameters u must 
satisfy for the following system TS to have exactly n distinct real solution(s) (or 
simply, to have real solution(s)): 

{ .fi{u,xi) = 0, 
f2{u,Xi,X2) = 0 , 

(1) 

fs(u, Xi,X2, ...,Xs)=0, 

gi{u,X) > 0,g2{u,X) > 0, . . ,,gt{u,X) > 0 

where u = • • ■ ,Ud), X = (xi,X 2 , ■ ■ -,Xs), 

fi € Z{u)[xl,...,x^], l<i<s, 

gj G Z{u)[xi,...,Xs], l<j<t, 

and {/i, fy, • ■ ■ , /s} is a “normal ascending chain” [33,34] which is “simpli- 
cial” [33,34] w.r.t. each gj (1 < j < t) (also see Section 3 of the present paper 
for the definitions of these two concepts). Some of the inequalities in (1) may be 
strict ones. 

Before we go further, we illustrate this class of problems by an example: 
Give the necessary and sufficient condition for the existence of a triangle with 
elements s, r, R, where s, r, R are the half perimeter, inradius and circumradius, 
respectively. 

Let a, b, c be the lengths of sides and, without loss of generality, let s = 1; we 
have 

{ Pi = a + 5+ c— 2 = 0, 

P 2 = - (1 - a)(l - b){l - c) = 0, 

P 3 = 4rR— abc = 0, 

0<a<l,0<6<l,0<c<l,0<r, 0<i?. 

It is easy to see that PS' is equivalent to the following system: 

{ /i (r, i?, a) = — 2a^ + (r^ + 4 ri? + l)a — 4 ri? = 0, 

f 2 {r,R,a,b) = ab"^ + a(a - 2)6 + 4 ri? = 0, 

/a(r, i?, a, 6, c) = a + 6 + c — 2 = 0, 
0<a<l,0<6<l,0<c<l,0<r, 0<i?. 

where a, 6, c are the variables and r, R the parameters. What we need to do 
is to find the condition satisfied by r, R, under which TS' has real solution(s). 
Obviously, this is a problem of the class defined above. 

By the algorithm presented in this paper, we can easily find the condition 

+ 2 - 4 _ 20 rRs^ + 12 r^R + 48 + 64 rR^ < 0, 

which is called the “Fundamental Inequality” [19] for triangles. 
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3 Concepts and Notations 

In this section, we review some concepts and notations from [33,34] and [30,7,17]. 
Given a polynomial g and a triangular set {/i, / 2 , . . . , /s}, let 

Ts=g, Ts-j = resultant(rs_j+i,/s_i+i,a;s_j+i), i = l,2,...,s, 
qs = g, = prem(9s_i+i,/s_j+i,a;s-i+i), * = l,2,...,s; 

and res(g, fs, . ■ ■ , fi) and prem(g, fs, . ■ . , fi) denote tq and go, respectively. 

Definition 3.1 (Normal Ascending Chain [33,34]). Given a triangular set 
{/i, / 2 , ■ ■ • , fs}, by /i (i = 1, 2, . . . , s) denote the leading coefficient of fi in Xi. 
The triangular set {/i, / 2 , ■ ■ • , /«} is called a normal ascending chain if 

/i yf 0, res(/i,/i_i,...,/i) yf 0, i = 2, . . . , s. 

Definition 3.2 (Simplicial [33,34]). A normal ascending chain {/i, / 2 , • • ■ , /«} 
is simplicial with respect to a polynomial g if either prem(g, /g, . . . , /i) = 0 or 
res(g,/s,...,/i) yf 0. 

Definition 3.3 (Discrimination Matrix [30]). Given a polynomial with general 
symbolic coefficients 



f{x) = agx^ + aix^ ^ + h a„, 

the following 2n x 2n matrix in terms of the coefficients 



ao 


Ol 


02 


■ ■ ‘ 


0 


nao 


(n — l)ai 


• • ■ Ctrl — 1 




flQ 


Ol 


^n— 1 




0 


nao 


• • ■ 2ctri— 2 ^n — 1 



OQ 


ai 


02 


0 


rtao 


(n- 1) 



is called the discrimination matrix of f{x), and denoted by Discr(/). By dk 
or dk{f) denote the determinant of the submatrix of Discr(/), formed by the 
first k rows and the first k columns for fc = 1, 2, . . . , 2n. 

Definition 3.4 (Discriminant Sequence [30]). Let Dq = 1 and Dk = d 2 k, 
k = 1, . . . , n. We call the n + 1-tuple 



Do, Di, D 2 , . . . , Dn 

the discriminant sequence of /(x), and denote it by DiscrList(/). 

The last term is also called the discriminant of / with respect to x, and 
denoted by Discrim(/, x). It should be noted that the definition of Discrim(/, x) 
here is little different from the others which are DnjaQ- 
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Definition 3.5 (Sign List [30]). We call the list 

[sign(Ao),sign(^i),sign(^ 2 ), • ■ . ,sign(^„)j 

the sign list of a given sequence Aq, Ai, A 2 , . . . , An, where 

( 1, a; > 0, 

sign(a;) = < 0, a; = 0, 

[ —1, a; < 0. 

Definition 3.6 (Revised Sign List [30]). Given a sign list [si, S 2 , . . . , s„], we 
construct a new list 

[tl , ^2 5 ■ ■ ■ ? tji\ 

(which is called the revised sign list) as follows: 

— If [si, Si+i, . . . , Si+j] is a section of the given list, where 

Si ^ 0, Si-|_i — * * * — Si-\.j — ± — 0, Si-^j ^ 0, 

then, we replace the subsection 

[sj+l J J Sj+J — 1 ] 

with the first j — 1 terms of ^ 

[ . . . ]. 

— Otherwise, let tk = Sk, i.e. no change for other terms. 

Example 3.1. The revision of the sign-list 

[1,-1, 0,0, 0,0, 0,1, 0,0, -1,-1, 1, 0,0,0] 

is [1,-1, 1,1, -1,-1, 1,1, -1,-1, -1,-1, 1,0, 0,0]. 

Theorem 3.1 ([30]). Given a polynomial f{x) with real coefficients 

/(x) = aox" -I- -I- h a„, 

if the number of the sign changes of the revised sign list of 

Do,Diif),D2if),...,Dnif) 

is v, then the number of the pairs of distinct conjugate imaginary roots of /(x) 
equals v. Furthermore, if the number of non-vanishing members of the revised 
sign list is I, then the number of the distinct real roots of /(x) equals I — 1 — 2v. 



^ that is, let U+r = (—1)^ ^ • Si, r = 1,2, . . . , j — 1. 



Automated Discovering and Proving for Geometric Inequalities 



35 



Definition 3.7 (Generalized Discrimination Matrix). Given two poly- 
nomials g{x) and f{x) where 

f{x) = qqx^ + aix^~^ + h a„, 



let ^ 

r(a;) = rem(/'5, /, a;) = boX^~^ + H h bn-i- 

The following 2n x 2n matrix 



U-n 
^n— 1 
^n— 1 

bn-2 bn-1 



ao cii 02 ■ ■ ■ a„ 

0 6 q bi ■ ■ ■ bn-i _ 

is called the generalized discrimination matrix of f{x) with respect to g{x), and 
denoted by Discr(/, g). 

Definition 3.8 (Generalized Discriminant Sequence [7,17]). Given two polyno- 
mials f{x) and g{x), let the notations be as above, and Dq = 1. Denote by 

Di(/,g), D2(/,5), ...,D„(/,g) 

the even order principal minors of Discr(/, g). We call 

Do, Di(/, 5),D2(/,5), ...,D„(/,g) 

the generalized discriminant sequence of f{x) with respect to g{x), and denote 
it by GDL(/, g). Glearly, GDL(/, 1) = DiscrList(/). 

Theorem 3.2 ([7,17]). Given two polynomials f{x) and g{x), if the number of 
the sign changes of the revised sign list of GDL(/, g) is iz, and the number of 
non- vanishing members of the revised sign list is I, then 



ao fli 02 
0 bo bi 
ao «! 
0 bo 



l-l-2u = c{f, g+) - c(/, g-), 

where ^ 

c{f,9+) = card({a; S i?|/(a;) = 0,g(a;) > 0}), 

c(/,5-) = card({a; S R\f(x) = 0,g{x) < 0}). 

^ By rem(a(o;), b(x), a;) denote the remainder of a{x) divided by b{x). 
® By card(-) represent the cardinal number of a set. 
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4 A Theoretical Algorithm 

In this section, we give an algorithm which has theoretical significance. The 
practical algorithm is given in the next section. 

Let 

Ps = {Pi\l < j < n} 

be a nonempty, finite set of polynomials. We define 

mset(ps) = {1} U {pi^Pi^ ■ ■ ■ PiA^ < k < n,l < h < i2 < ■ ■ ■ < ik < n} ■ 

For example, if ps = {pi, p 2 , Ps}, then 

mset(ps) = {1, pi, P 2 , P3, PlP 2 , PlP3, P2P3, PlP2Pz\- 
Given system TS as (1), we define 



Ps+i = 

U,= y GDL(/„9), 

gGmset(Pi+i) 

Pj = {h{u,xi, . . . ,Xi-i)\h e C/J, for i = s, s - 1, . . . , 2; 
Pi{9i,92,---,gt) = {h{u)\hG Ui}, 

where Ui means the set consisting of all the polynomials in each GDL(/j, q) with q 
belonging to mset(Pi+i). Analogously, we can define Pi(pi, . . . , gj) (1 < j < t). 
Then we have 

Theorem 4.1. The necessary and sufficient condition for system TS to have a 
given number of distinct real solution (s) can be expressed in terms of the signs 
of the polynomials in Pi(pi, p 2 , ■ • ■ , 9t)- 

Proof. First of all, we regard fs and every gi as polynomials in Xs- By Theo- 
rems 3.1 and 3.2 we know that under constraints Pi>0,l<i<t, the number 
of distinct real solutions of A = 0 can be determined by the signs of polyno- 
mials in Pg] let hj (t < j < 1 ) he the polynomials in Pg] we regard every hj 
and fs-i as polynomials in Xg-i, repeat the same argument as what we did 
for fg and gfs, then we get that, under constraints Pi > 0, 1 < i < t, the number 
of distinct real solutions of fg = 0, fg-i = 0 can be determined by the signs of 
polynomials in Pg-i; do the same argument until Pi(pi,p 2 , ■ ■ ■ ,9t) is employed. 
Because the conditions obtained in each step are necessary and sufficient, the 
theorem holds. 

Now, theoretically speaking, we can get the necessary and sufficient condition 
for system TS to have (exactly n distinct) real solution(s) as follows: 

Step 1 Gompute Pi(pi,p 2 , • ■ • ,5t), the set of polynomials in parameters, which 
is defined above for TS. 
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Step 2 By the algorithm of PCAD [2,3], we can obtain Pi-invariant cad D of 
parameter space and its cylindrical algebraic sample (cas) S [24]. Roughly 
speaking, H is a finite set of cells that each polynomial of Pi keeps its sign 
in each cell; and S' is a finit set of points obtained by taking from each cell 
one point at least, which is called the sample point of the cell. 

Step 3 For each cell c in P and its sample point Sc € S, substitute Sc into 
TS (denote it by TS{sc))- Compute the number of distinct real solutions of 
system TS(sc), in which polynomials all have constant coefficients now. At 
the same time, compute the signs of polynomials in Pi ( 51 ,^ 2 , ■ ■ ■ ,gt) on this 
cell by substituting Sc into them respectively. Record the signs of polynomials 
in Pi{gi, g 2 , ■ ■ ■ , gt) when the number of distinct real solutions of system 
TS{sc) equals to the required number n (or when the number > 0, if we 
are asked to find the condition for TS to have real solution) . Obviously, the 
signs of polynomials in Pi{gi, g 2 , . ■ ■ ,gt) on cell c form a first order formula, 
denoted by d>c- 

Step 4 If, in step 3, all we have recorded are <Pci , ■ ■ • , , then V- • -V^cfc 

is what we want. 



5 The Practical Algorithm 

The algorithm given in Section 4 is not practical in many cases since 
-Pi ( 51 1 52 , •■•,<?*) usually has too many polynomials. Because not all those poly- 
nomials are necessary for expressing the condition we want, so we have to give an 
efficient algorithm to select those necessary polynomials from Pi(gi,g 2 , ■ ■ ■ ,gt)- 

Theorem 5.1. Let system TS be given as above. If PolySet is a finite set of 
polynomials in parameters u, e.g. 

PolySet = {qi{u) S Z[ui, . ■ ■ ,Ud]\l < i < k}, 

then, by the algorithm of PCAD, we can obtain a PoZyS'et-invariant cad D of 
parameter space and its cas. If PolySet satisfies that 

1 . the number of distinct real solutions of system TS is invariant in the same 
cell and 

2. the number of distinct real solutions of system PS' in two distinct cells Ci, C 2 
is the same if PolySet has the same sign in Ci, (72, 

then the necessary and sufficient conditions for system TS to have exactly n 
distinct real sohition(s) can be expressed by the signs of the polynomials in 
PolySet. If PolySet satisfies (I) only, then some necessary conditions for system 
TS to have exactly n distinct real solution(s) can be expressed by the signs of 
the polynomials in PolySet. 

Proof. We replace parameters u in TS with each sample point respectively. Be- 
cause D is PoZy5et-invariant and PolySet satisfies (1), we can record the signs 
of polynomials in PolySet and the number of distinct real solutions ot TS on 
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each cell respectively. Choose all those cells on which TS has n distinct real 
solution(s). The signs of polynomials in Poly Set on those cells form a first order 
formula, e.g. 

^ V ^2 V • • • V . 

where each represents the signs of polynomials in PolySet on a certain cell 
on which TS has n distinct real solution(s). We will say that is the condition 
we want. 

Given a parameter a = (ai,...,ad), which must fall into a certain cell, if 
TS{a) has n distinct real solution(s), then a must belong to a cell on which TS 
has n distinct real solution(s), i.e. a must satisfy a certain formula on the 
contrary, if a satisfies a certain formula then, because TS has n distinct real 
solution(s) on the cell represented by <l>i and PolySet satisfies (2), we thus know 
that TS must have n distinct real solution(s) on the cell which a belongs to. 
Thus, the theorem holds. 

Given system TS. For every /j, let 
Ri = Discrim(/i, xi), 

Ri = res(Discrim(/i,Xi),/i_i,/i_2,...,/i), i > 2, 

where the definition of res( ) can be found at the beginning of Section 3. 

For every gj, let 

Rgj = res{gj,fs, fs-i,fs-2, ■ ■ • , /i)- 

We define 

BPs = < i < s} < j < t}. 

Glearly, BPs 'T Pi{gi,g2, . ■ ■ ,gt). 

Theorem 5.2. Given system TS, BPs is defined as above. If we consider only 
those cells which are homeomorphic to and do not consider those cells which 
are homeomorphic to (fc < d) when use PGAD, then BPs satisfies (1) in 
theorem 5.1, so the necessary conditions (if we omit the parameters on those cells 
homeomorphic to R^ (fc < d)) for system TS to have n distinct real solution(s) 
can be expressed by the signs of the polynomials in BPs. 

Proof. By PGAD, we can get a RPs-invariant cad of R'^ and its cas. Because 
we consider only those cells which are homeomorphic to R^, given a cell C, the 
signs of each Ri and Rgj on C are invariant and do not equal to 0. 

First of all, by the definition of R\, the sign of R\ on C is invariant means 
that the number of real solutions of fi{u,xi) is invariant on C; then we re- 
gard f2(u,xi,X2) as a polynomial in X2, because on C, 

fi{u,xi) = 0, i?2 = res(Discrim(/2,X2),/i,a;i) yf 0, 

thus Discrim(/2, 12) 0 on C, i.e. if we replace xi in /2 with the roots of /i, 

the number of real solutions of /2 is invariant. That is to say, the signs of i?i 
and i?2 are invariant on C means the number of real solutions of /i = 0, /2 = 0 
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is invariant on C; now, it’s easy to see that the signs of i?i, i?s are invariant 
on C means the number of real solutions of /i = 0, . . . , /s = 0 is invariant on C . 

Secondly, by the definition of we know that yf 0 means, on C, if we 
replace xi, . . . , Xg in with the roots of /i = 0, . . . , /s = 0, then the sign of g^ 
is invariant. That completes the proof. 

By theorem 5.2, we can start our algorithm from BPs as follows: 

Step 1 Let PolySet = BPs, i = 1. 

Step 2 By the algorithm of PCAD [2,3], compute a PolyS'et-invariant cad D 
of the parameter space and its cylindrical algebraic sample (cas) S [24]. 
In this step, we consider only those cells homeomorphic to and do not 
consider those homeomorphic to R^ {k < d), i.e., all those cells in D are 
homeomorphic to R'^ and all sample points in S are taken from cells in D. 
Step 3 For each cell c in D and its sample point Sc G S, substitute Sc into 
TS (denote it by TS{sc))- Compute the number of distinct real solutions of 
system TS{sc), in which polynomials all have constant coefficients now. At 
the same time, compute the signs of polynomials in PolySet on this cell by 
substituting Sc into them respectively. Obviously, the signs of polynomials 
in PolySet on cell c form a first order formula, denoted by <Pc- When all the 
TS{scYs are computed, let 

sefi = {<l>c\ TS has n distinct real solution(s) on c}, 

seto = {^c| TS has not n distinct real solution(s) on c}. 

Step 4 Decide whether all the recorded <Pc’s can form a necessary and sufficient 
condition or not by verifying whether sefi fl seto is empty or not (because 
of Theorem 5.1 and Theorem 5.2). If seti n seto = 0, go to Step 5; If 
seti n seto 7 ^ 0, let 

PolySet = PolySetU Pi{gi, ■ ■ ■ ,gi), i = i + 1, 
and go back to Step 2. 

Step 5 If seti = {^ci, ■ • ■ , ^c„}, then ^ = <Pci V • • • V is what we want. 

Remark. In order to make our algorithm practical, we do not consider the 
“boundaries” when use PCAD. So, the condition obtained by this algorithm is 
a necessary and sufficient one if we omit the parameters on the “boundaries.” In 
another word, all the conditions obtained by our program DISCOVERER, which 
implements this algorithm, should be understood as follows: if the strict inequal- 
ities hold, the conditions hold; if equalities hold, more discussion is needed. 

Now, we consider following problem: 

Give the necessary and sufficient condition which the parameters u must 
satisfy for the following system PS to have (exactly n distinct) real solution (s) 



PS : 



hi{u,X) = 0, li2{u, X) = 0, . . . , hs{u, X) = 0 
gi{u,X) > 0,g2{u,X) > 0, . . .,gt{u,X) > 0 



( 2 ) 
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where u means ui,U 2 , • ■ • ,Ud^ treated as parameters; X means xi, X 2 ^ ■ ■ Xg, 
treated as variables. That is to say, 

hi,gj & Z{ui,...,Ud)[xi,...,Xs\, 1 < z < s, l<j<t. 

First of all, by Wu’s elimination [25,26], we can reduce the system 
h\{u,X) = 0, h 2 {u,X) = 0, . . . , hs{u,X) = 0 to some “triangular sets.” Then, 
if necessary, by WR algorithm [33,34], make these triangular sets into normal 
ascending chains in which every chain is simplicial w.r.t. each gj (1 < j < t). So, 
under some nondegenerate conditions, we can reduce PS to some TS’s. These 
nondegenerate conditions, however, do not bring any new limitations on our al- 
gorithm because we do not consider “boundaries” when use PCAD and all those 
parameters which make degenerate conditions true are contained in “bound- 
aries.” Another situation we do have to handle is some TS’s reduced from PS 
may have real solutions with dimension greater than 0. In our present algorithm 
and program, we do not deal with this situation and if it occurs, DISCOVERER 
outputs a message and does nothing else. 

6 Examples 

Many problems with various background can be formulated into system PS and 
can be solved by DISCOVERER automatically. 

The calling sequence of DISCOVERER for system PS is: 

tofind([hi, . . . ,/is], [gi, . . . , 3 t], [xi, . . . ,Xs,ui, . . .,Ud],a); 
where a has following three kind of choices: 

— a non-negative integer ly which means the condition for PS to have v distinct 
real solution(s) exactly; 

~ a range {ly, g are non-negative integers, v < g) which means the condi- 
tion for PS to have v or v + \ or ... or g distinct real solutions; 

~ a range v..n {v is a non-negative integer, n a name) which means the condi- 
tion for PS to have v or more than v distinct real solutions. 

Example 6.1. Which triangles can occur as sections of a regular tetrahedron 
by planes which separate one vertex from the other three? 




Figure 1 
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This example appeared as an unsolved problem in the American Mathemat- 
ical Monthly (Oct. 1994) [15]. In fact, it is a special case of the well-known 
Perspective-three-Point (PSP) problem (see the following Example 6.2). 

If we let l,a,b (assume 6 > a > 1) be the lengths of three sides of the 
triangle, and x,y,z the distances from the vertex to the three vertexes of the 
triangle, respectively (see Figure 1), then, what we need is to find the necessary 
and sufficient condition that a, b should satisfy for the following system to have 
real solution(s) 



{ hi = x'^ + y'^ - xy - 1 = 0, 
h 2 = y"^ + — yz — a?' = 0, 

hs = z'^ + x^ — zx — P = 0, 

X > 0,y > 0, z > 0,a — 1 > 0,6 — a > 0. 

With our program DISCOVERER, we need only to type in 



tofind([/ii,/i 2 ,/i 3 ], [x,y,z,a- 1,6- a], [x,y, z,a,b],l..n)] 
DISCOVERER runs 26 seconds on a K6/233 PC with MAPLE 5.3, and outputs 



FINAL RESULT: 

The system has required real solution(s) IF AND ONLY IF 
[R1 > 0, i?2 > 0] or [R1 > 0, i?2 < 0, i?3 > 0] 



where 



R1 = + a + 1 — b^ 

R2 = a'^ -1 + b-b^ 

1 8 n 8,2 16 o 68 2 241 , , 68 ,2 « 

R3=l 6^ -k — a® 6®a^ -k b*a^ 6^ a® 

3 3 9 27 81 27 

68 4 2 68 2 4 ^ ?,6 I 16 8 2 g 46 2 2 

-27'’“ -57'’“ “9'' “9“ +- 9 -‘“ 

16 ,4 16 4 46 ,2 a 46 ,8 2 68 ,g 4 68 ,4 g 

+ -6^+ -a^+ -6V+ -6®a^- -6®a^- -6V 

+ |6iV+ ^6®a"- ^6®a®- 

9 3 9 9 3 3 



+ 512. 0^2 



The article [15] has given a sufficient condition that any triangle with two 
angles > 60° is a possible section. It is easy to see that this condition is equivalent 
to [i?l > 0, i?2 > 0]. 



Example 6.2. Given the distance between every pair of 3 control points, and 
given the angle to every pair of the control points from an additional point called 
the centre of perspectivity (say P), find the lengths of the segments joining P 
and each of the control points. This problem originates from camera calibration 
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and is called perspective-three-point (PSP) problem. The corresponding algebraic 
equation system is called the PSP equation system. 

So-called a solution classification of PSP equation system is to give explicit 
conditions under which the system has none, one, two, . . ., real physical solutions, 
respectively. This problem had been open for many years [16], until [28] appeared 
recently. 

This example is about a special case of PSP problem and was studied in 
different way by Gao and Cheng [16]. Suppose the control points A,B,C form 
an equilateral triangle and ICPA = lAPB, give the solution classification. The 
corresponding equation system is 

( hi = y'^ -\- z'^ - 2yzp -1 = 0, 

I h 2 = z^ -\- — 2zxq —1 = 0, 

I h^ = x'^ -\- y'^ — 2zxq — 1 = 0, 

[x>0, y>0, 2>0, 1— p^>0, 1 — g^>0, p-l-1 — > 0. 

where parameters p, q denote the cosines of the angles 

IB PC, ICPA (= lAPB), 

and X, y, z denote the lengths of the segments PA, PB, PC, respectively. 

By our program DISCOVERER, we need only type in 

tofind([/ii,/i2,/i3], [x,y,z, 1 -/,1 -(?^p + 1 - 2g^], [x,y,z,p,q],0); 
tofind([/ii,/i2,/i3], [x,y,z, 1 -p^, 1 - q^,p + l - 2q^], [x,y,z,p,q], 1); 
tofind ([hi, h 2 , h 3 ],{x,y, z,l - p'^ ,1 - q'^ ,p + 1 - 2q^\, [a;, y, z,p, q],2)-, 
tofind([/ii,/i2,/i3], [x,y,z, l-p^,l-q^,p + l - 2q% \x,y,z,p,q],?,)-, 
tofind ([/ii,/i2,/i3], [x,y,z, 1 -p^,l -g^p + 1 - 2q\ [x,y,z,p,q\,C)-, 
tofind ([/ii,/i2,/i3], [x,y,z, 1 -p^,l -g^p + 1 - 2g^j, \x,y,z,p,q],f>..n)-, 

respectively. After 64 seconds, on a PII/266 PC with MAPLE 5.4, we get the 
solution classification except some boundaries: 

The system has 0 real solution IF AND ONLY IF 

[Rl >0, R2> 0, i?3 < 0, i?4 > 0] 
or 

[Rl < 0, i?2 > 0, i?3 < 0] 

The system has 1 real solution IF AND ONLY IF 

[R2 < 0, i?3 < 0, i?5 > 0] 
or 

[R2 < 0, i?5 < 0] 

The system has 2 real solutions IF AND ONLY IF 
[Rl > 0, i?2 > 0, i?3 < 0, i?4 < 0, R5 > 0] 
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or 



[R1 > 0, R2> 0, i?3 > 0, i?4 > 0, R5 > 0] 



[R1 >0, R2> 0, R4 < 0, i?5 < 0] 

The system has 3 real solutions IF AND ONLY IF 
[R2 < 0, i?3 > 0, i?4 < 0, i?5 > 0] 

The system has 4 solutions IF AND ONLY IF 
[R1 > 0, i?2 > 0, i?3 > 0, i?4 < 0, R5 < 0] 

The system has 5..n real solutions IF AND ONLY IF 
There are not 5..n real solutions in this branch 



R1 = q, 

R2 = 2p- 1, 

RS = 2q- 1, 

R4 = 2p-l-q^, 
R5 = 2pq^ - 3g^ + 1. 



Note that p + 1 — 2q^ > 0, the given condition should hold in all cases. And if 
you want to know the situation when parameters are on a boundary, say R2, 
you need only to type in 



or 



where 




Figure 2 
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In this way, we obtained the complete solution classification, as indicated in 
Figure 2. 

Example 6.3. Solving geometric constraints is the central topic in many cur- 
rent work of developing intelligent Computer Aided Design systems and interac- 
tive constraint-based graphic systems. The existing algorithms do not consider 
whether the given system has real physical solution(s) or not. Our algorithm and 
program can give the necessary and sufficient condition for the system to have 
real physical solution(s). For example: 

Give the necessary and sufficient condition for the existence of a triangle with 
elements a,ha,R, where a,ha,R means the side-length, altitude, and circumra- 
dius, respectively. 

Clearly, we need to find the necessary and sufficient condition for the following 
system to have real solution(s) 

/i = — 4s(s — a){s — b){s — c) = 0, 

/2 = 2Rha - 6c = 0, 

< fs = 2s — a — b — c = 0, 
a > 0,6> 0,c> 0,a-l-6— c> 0,6-|-c— a > 0, 
c -I- a — 6 > 0, i? > 0, /la > 0. 



We type in 

tofind([/i, / 2 , /s], [a,b,c,a + b - c,b + c - a,c + a - b,R, ha], [s, b, c, a,R, ha], l..n); 
DISCOVERER runs 4.5 seconds on a PII/266 PC with MAPLE 5.4, and outputs 

FINAL RESULT: 

The system has required real solution(s) IF AND ONLY IF 
[R1 > 0, i?3 > 0] or [R1 > 0, i?2 < 0, i?3 < 0] 

where 

R1 = R— -a 

1 9 

R2 = Rha — —a^ 

4 

1.9 . 1 9 

i?3 — — —ha + Rha — —a 

In [19], the condition they gave is R1 > 0 A i?3 > 0. Now, we know they are 
wrong and that is only a sufficient condition. 

Our program, DISCOVERER, is very efficient for solving this kind of prob- 
lems. By DISCOVERER, we have discovered or rediscovered about 70 such condi- 
tions for the existence of a triangle, and found three 
mistakes in [19]. 
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Abstract. The approach previously used to mechanise lemmas and Ke- 
pler’s Law of Equal Areas from Newton’s Principia [13] is here used 
to mechanically reproduce the famous Propositio Kepleriana or Kepler 
Problem. This is one of the key results of the Principia in which New- 
ton demonstrates that the centripetal force acting on a body moving in 
an ellipse obeys an inverse square law. As with the previous work, the 
mechanisation is carried out through a combination of techniques from 
geometry theorem proving (GTP) and Nonstandard Analysis (NSA) us- 
ing the theorem prover Isabelle. This work demonstrates the challenge 
of reproducing mechanically Newton’s reasoning and how the combina- 
tion of methods works together to reveal what we believe to be flaw in 
Newton’s reasoning. 



1 Introduction 

The reasoning of Newton’s PhilosophicB Naturalis Principia Mathematica (the 
Principia [14]), as it was originally published, is a mixture of geometric and al- 
gebraic arguments together with Newton’s own proof techniques. These combine 
to produce a complex mathematical reasoning that is used to explain the phys- 
ical world. The demonstrations of Lemmas and Propositions in the Principia 
are, in fact, proof sketches that require a lot of work on the part of the reader 
for a detailed understanding. There are several reasons that make the Principia 
a very difficult text to master. First of all, the proofs are very involved and one 
requires an adequate knowledge of geometry to be able to understand many of 
the steps made by Newton. Secondly, Newton’s exposition can be tedious and 
difficult to grasp in places. Many mathematicians contemporary to Newton, de- 
spite their grounding in ancient Greek geometry and familiarity with the style of 
the exposition, had difficulties understanding Newton’s mathematical reasoning. 
This gives an indication of the demands that a thorough study of the Principia 
has on the modern reader. 

As we mentioned in a previous paper, Newton’s geometry is also notable 
for his use of limit or ultimate arguments in his proofs [13]. These are implicit 
notions of differential calculus that are at the core of Newton’s treatment and 
give Newton’s geometry an infinitesimal nature. Newton further adds motion to 
his reasoning and enriches the geometry with various kinematics concepts that 
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enable points to move towards points for example. Thus, Newton’s geometry 
consists in studying the relations, such as ratios, between various parts of the 
constructed diagrams as certain of its elements tend towards limiting positions 
or become infinitely small. 

In this paper, we build on the tools and techniques that we presented be- 
fore [13]. In section 2, we review the geometric methods and concepts that we 
have formalised in this work. We also give examples of theorems proved in Is- 
abelle using these techniques. Section 3 is a brief introduction to the concepts 
from NSA that we use; it also outlines the infinitesimal aspects of our geometry. 
We then present in Section 4, as a case study, the proof of the Kepler Propo- 
sition. This is a key proof of the Book 1 of the Principia and our work follows, 
in its steps, the analysis made by Densmore [9]. This extended case study shows 
our combination of techniques from geometry and NSA at work to provide a for- 
mal proof of a major proposition. The challenge inherent to the mechanisation 
of Newton’s reasoning- especially in an interactive environment such as Isabelle 
where the user guides the proof- will become obvious as we highlight the steps 
and difficulties encountered. Section 5 offers our comments and conclusions. 

2 Geometry 

We use methods that are based on geometric invariants [5,6] and high level 
geometry lemmas about these invariants. A particular property is ideal as an 
invariant if it ensures that the proofs generated are short. This enables some of 
the proofs to be derived automatically using the powerful tactics of Isabelle’s 
classical reasoner. Also, the methods should be powerful enough to prove many 
properties without adding auxiliary points or lines. The other important aspect 
is to achieve diagram independence for the proofs, that is, the same proof can 
be applied to several diagrams. 

2.1 The Signed Area Method 

In this method, there are basic rules about geometric properties called signed 
areas. These can be used to express various geometric concepts such as collinear- 
ity (coll), parallelism (jj) and so on. Moreover, the basic rules can be combined 
to prove more complex theorems which deal with frequently-used cases and help 
simplify the search process. 

We represent the line from point A to point B by A — B, its length by 
len(A — B), and the signed area SdeitaAiJC of a triangle is the usual notion 
of area with its sign depending on how the vertices are ordered. We follow the 
usual approach of having SdeitaABC as positive if A — B — C is in anti-clockwise 
direction and negative otherwise. Some of the rules and definitions used are: 



a 6 jj C d — (Sdelta ab C — Sdelta ^ b C?) 

coll abc len(a — b) X SdeitaP^c = len(b — c) x SdeitaPa^ 
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We can also introduce new points using the following property and define the 
signed area of a quadrilateral Squad a ^cc? in terms of signed areas of triangles: 



^delta a b C — Sdelta a b d Sdelta a d C Sdelta db C 
^quad abed Sdelta a b C ~\~ Sdelta a C d 

We have proved a number of theorems about the sign of Squad abed that 
depend on the ordering of the vertices, for example Squad a ^cc? = — Squad ad c 6. 

When dealing with geometry proofs, we often take for granted conditions 
that need to be stated explicitly for machine proofs: for example, two points 
making up a line should not coincide. The machine proofs are valid only if 
these conditions are met. These are known as non- degenerate conditions and are 
required in many cases to prevent the denominators of fractions from becoming 
zero in the various algebraic statements. 



2.2 The Full- Angle Method 

A full-angle (w, v) is the angle from line u to line v measured anti-clockwise. We 
define the equivalence relation of angular equality as follows: 

X =a y = G TN. \x — y \ = nn 
and can use it to express that two lines are perpendicular 

7T 

a — bJ-c— d = {a — b, c — d) =a — 

Other properties of full-angles concern their sign and how they can be split 
or joined. The same rule therefore either introduces a new line or eliminates a 
common one from the full angles depending on the direction in which it is used. 



{u,v) =a ~{V,U) 

(u,v) =a (U,x) {x,v) 

Full-angles are used instead of traditional angles because their use simplifies 
many proofs by eliminating case-splits. Moreover, as we have already mentioned, 
these methods are useful to us since they relate closely to the geometric prop- 
erties used by Newton [13]. They preserve the intuitive nature of his geometry 
and can easily be extended with infinitesimal notions, as we will see shortly. 

2.3 A Simple Example: Euclid 1.29 

Euclid’s proposition 29 of Book I [10], can be easily proved using the full-angle 
method. The proposition states that \i A — B || C — D and the transversal P — Q 
intersects A — B and C — D then {A — B, P — Q) = {C — D, P — Q). 
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We prove this theorem easily by using the rules about full-angles given in 
Section 2.2 and the fact the angle between two parallel lines is zero. 

Proof: 

A - B \\ C - D ^ {A- B,C - D) =a0 

{A-B,P-Q) + {P-Q,C- D) =a0 
{A-B,P-Q) =a -{P-Q,C-D) 
{A-B,P-Q) =, {C-D,P-Q). 



2.4 Extending the Geometric Theory 

The main aim of the Principia is to investigate mathematically the motion of 
bodies such as planets. Thus, we need to have definitions for geometric figures 
such as the circle, the ellipse and their tangents. The ellipse is especially impor- 
tant for this work since Kepler’s Problem is concerned with elliptical motion. 
The circle can be viewed as a special case of the ellipse where the foci coincide. 



ellipse /i /a r = {p. |len(/i -p)| -f |len (/2 - p)\ = r} 
circle xr = ellipse x x (2 ■ r) 
arcAenx ab = |len(a; — o)| x {a — x,x — b) 
e_tangent (a — b) E = (is_ellipse /i /2 E A a G E A 

(/i - a,a- b) =a (6 -a, 0 -/ 2 )) 

We need to prove a number of properties relating to the ellipse such as the 
one stating that all parallelograms described around a given ellipse are equal to 
each other (Figure 2). 

This relationship appears (in slightly different wording) as Lemma 12 of 
the Principia where it is employed in the solution of Proposition 11, the famous 
Propositio Kepleriana or Kepler problem. Newton refers us to the “writers on 
the conics sections” for a proof of the lemma. This lemma is demonstrated in 
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Fig. 2. Circumscribed Parallelograms 



Book 7, Proposition 31 in the Conics of Apollonius of Perga [1]. Of course, unlike 
Newton, we have to prove this result explicitly in Isabelle to make it available 
to any other proof that might use it. 

3 Infinitesimal Geometry 

In this section, we give a brief overview of our geometry containing infinitesimals. 
We first give formal definitions for the various types of numbers that exist and 
which can be used to describe geometric quantities. 

3.1 The Nonstandard Universe IR* 

Definition 1. In an ordered field extension IR* A IR, an element x € IR* is 
said to be an infinitesimal if |a;| < r for all positive r S IR; finite if |a;| < r 
for some r G IR; infinite if |a;| > r for all r G IR. 

The extended, richer number system IR* is known as the hyperreals. It has 
been developed in Isabelle through purely definitional means using an ultrapower 
construction. We will not give more details of this substantial construction in 
the present paper so as to concentrate on the geometric aspects only. 

Definition 2. x,y G IR* are said to be infinitely close, x ^ y if \x — y\ is 
infinitesimal. 

This is an important equivalence relation that will enable us to reason about 
infinitesimal quantities. For example, we can formalise the notion of two points 
coinciding by saying that the distance between them is infinitely close to zero. 
Two geometric quantities that become ultimately equal can also be modelled 
using it. The relation and its properties are used to formalise ultimate situations 
that might be considered degenerate by ordinary GTP methods [13]. 

Using the relation, we can also define the concept of two full-angles being in- 
finitely close: 



ai 



tt2 = G IN. [fli — 02 ] 



rnr 
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Various other new properties, such as ultimately similar and ultimately congruent 
triangles, can then be defined as we showed previously [13]. These are then 
used to prove various Lemmas that are needed to demonstrate the important 
Propositions of the Principia. 

4 An Overview of the Proof 

This is Proposition 11 of Book 1 of the Principia. This Proposition is im- 
portant for both mathematical and historical reasons as it lays the foundations 
for Kepler’s first law of Gravitation. It provides the mathematical analysis that 
could explain and confirm Kepler’s guess that planets travelled in ellipses round 
the sun [15]. 

The proof of this proposition will be studied in detail as it gives a good 
overview of the mixture of geometry, algebra and limit reasoning that is so char- 
acteristic of Newton’s Principia. It also gives an idea of the depth and amount of 
mathematical expertise involved in Newton’s proof. The proof that Newton de- 
scribes, though relatively short on paper, becomes a major demonstration when 
expanded and reproduced using Isabelle. The elegance of many of the construc- 
tions, which could be glossed over, are revealed through the detailed analysis. 

We give formal justifications of the steps made by Newton in ultimate situ- 
ations through our formal and logical use of infinitesimals. Infinitesimal reason- 
ing is notorious for leading to contradictions. However, nonstandard analysis is 
generally believed to be consistent and hence ensure that our mechanisation is 
rigorous. We will give the enunciation of the Proposition and the proof (sketch) 
provided by Newton. We will then expand on the sketch and provide detailed 
proofs of the steps that are made by Newton. This will require the use of the 
rules from the geometric and NS A theories developed in Isabelle. 

4.1 Proposition 11 and Newton’s Proof 

Proposition 11 is in fact stated as a problem by Newton at the start of Section 3 
of the Principia. This section deals with ''‘the motion of bodies in eccentric conic 
section^’’ . Particular orbits and laws governing forces that are relevant to the 
universe are investigated. The mathematical tools are developed for later use in 
Book III of the Principia when natural phenomena of our world are investigated. 
Our task consists in expressing Newton’s result as a goal which is then proved. 
Figure 3 shows Newton’s original diagram used for this Proposition. 

Proposition 11 If a body revolves in an ellipse; it is required to find the law of 
the centripetal force tending to the focus of the ellipse 

Newton’s Proof: 

Let S be the focus of the ellipse. Draw SP cutting the diameter DK of the 
ellipse in E, and the ordinate Qv in x; and complete the parallelogram QxPR. 
It is evident that EP is equal to the greater semiaxis AC : for drawing HI from 
the other focus H of the ellipse parallel to EC, because CS, CH are equal, ES, 
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Fig. 3. Newton’s Original Diagram for Proposition 11 



El will he also equal; so that EP is the half-sum of PS , PI , that is (beeause 
of the parallels HI, PR, and the equal angles IPR, HPZ), of PS, PH, whieh 
taken together are equal to the whole axis 2AC. Draw QT perpendicular to SP, 
and putting L for the principal latus rectum of the ellipse (or for ), we shall 
have 



L ■ QR : L ■ Pv = QR : Pv = PE : PC = AC : PC, 
also, L ■ Pv : Cv ■ Pv = L : Gv, and, Gv ■ Pv : Qv'^ = PC^ : CD"^ 

By Corollary 2, Lemma 1, when the points P and Q coincide, Qv^ = Qx^ , and 
Qx^ or Qv"^ : = EP'^ : = CA^ : PP^ and (by 

Lemma 12) = CD^ : CB^. Multiplying together corresponding terms of the four 
proportions, and by simplifying, we shall have 

L QR : QT^ = AC ■ L ■ PC^ ■ CD^ : PC ■ Gv CD^ ■ CB^ = 2PC : Gv, 

since AC ■ L = 2BC^. But the points Q and P coinciding, 2PC and Gv are 
equal. And therefore the quantities L ■ QR and QT^ , proportional to these, will 
also he equal. Let those equals be multiplied by and L-SP^ will become equal 
to — — . And therefore (by Corollary 1 and 5, Proposition 6) the centripetal 
force is inversely as L ■ SP^ , that is, inversely as the square of the distance SP. 
Q.E.I. 

Newton’s derivation concludes that the centripetal force, for a body moving 
in an ellipse, is inversely proportional to the square of the distance. 
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Our proof proceeds in several steps where we set up various relationships 
that we will need for the conclusion. This involves proving Newton’s intermediate 
results and storing them as intermediate theorems (we avoid calling them lemmas 
so as not to confuse them with Newton’s own Lemmas). 

4.2 A Geometric Representation for the Force 

An investigation of the Proposition and Newton’s result indicates that our goal 
is to prove that 3/c S M.. force Ki k x (i.e. force oc ultimately). We now 
demonstrate through a combination of geometric and infinitesimal procedures 
how to prove the theorem. 

Our combination of methods was previously used to prove Kepler’s Law of 
Equal Areas [13]. This is an important result which states that a body moving 
under the influence of a centripetal force describes equal areas in equal times. 
Using this result we can now derive a completely geometric representation for 
the force acting on the orbiting body. 




Fig. 4. Diagram for Geometric Representation of Force 



Consider Figure 4 in which a point P is moving along an arc of finite curvature 
under the influence of a centripetal force acting towards S. Let Q be a point 
infinitely close to P, that is, the length of the arc from P to Q is infinitesimal. QR, 
parallel to SP, represents the displacement from the rectilinear motion (along 
the tangent) due to the force acting on P. QT is the perpendicular dropped to 
SP. From Newton’s Lemma 10, Corollary 3, we have that displacement “in 
the very beginning of motion” is proportional to the force and the square of the 
time, and hence (for some real proportionality constant fci) that 



force 



ki X 



len(Q — R) 
Time^ 



( 1 ) 



Since the distance between P and Q is infinitesimal, the angle {P — S, S — Q) 
is infinitely small, and hence the area of the sector SPQ (SarcSPQ) is infinitely 
close to that of the triangle SPQ: 

(P - S, S - Q) 0 =4> Sarc S P Q « Sdelta SPQ 

=> Sarc S P Q « 1/2 X len(Q - T) X len(S - P) (2) 
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From Kepler’s Law of Equal Areas, we can replace Time by SarcSPQ [13] and, 
hence, using (1) and (2), we have the following geometric representation for the 
force (for some new proportionality constant k) 



force 



len(Q — R) 

len(Q - T)2 x len(S - P)^ 



( 3 ) 



This is a general result (Proposition 6 of the Principia) that applies to any 
motion along an arc under the influence of a central force. We justify the use 
of a circular arc for the general arc by the fact that it is possible to construct 
a circle at the point P that represents the best approximation to the curvature 
there. This circle, sometimes known as the osculating circle^, has the same first 
and second derivative as the curve at the given point P. Thus, the osculating 
circle has the same curvature and tangent at P as the general curve and an 
infinitesimal arc will also be same. We refer the reader to Brackenridge for more 
details on the technique [2,3]. 

With this result set up, to prove the Kepler Problem, we need to show that the 
ratio involving the infinitesimal quantities QR and QT is equal or infinitely close 
to some constant (finite) quantity. Thus, the proof of Proposition 11 involves, in 
essence, eliminating the infinitesimals from relation (3) above. This relation is 
transformed using the geometry of the ellipse to one involving only macroscopic 
(i.e. non-infinitesimals) aspects of the orbit. We show next how the various GTP 
and NS A techniques are applied to the analysis of an elliptical orbit to determine 
the nature of the centripetal force. 



4.3 Expanding Newton’s Proof 

A detailed account of our mechanisation of Newton’s argument for Proposition 11 
would take several pages since the proof sketch given by Newton is complex and 
we would have to present a large number of derivations. We will highlight the 
main results that were proved and, in some cases, details of the properties that 
needed to be set up first. We will also mention the constraints that needed to 
be satisfied within our framework before the various ratios that were proved 
could be combined. Our mechanisation was broken down into several steps that 
roughly followed from Newton’s original proof. The main results that are set up 
are as follows (see Fig. 3): 

~ len(E — P) = len(A — C) 

- len(A - C)/len(P - C) = L x len(Q - R)/L x len(P - v) 

— Lx len(P — v)/(len(G — v) x len(P — v)) = L/len(G — v) 

— len(G — v) X len(P — v) /len(Q — v)^ = len(P — C)^/len(C — D)^ 

- len(Q - v)Vlen(Q - T)^ « len(C - D)Vlen(C - B)^ 

Step 1: Proving len(E — P) = len(A — C) 

from the Latin osculare meaning to kiss- the term was first used by Leibniz 



1 



56 



Jacques D. Fleuriot and Lawrence C. Paulson 



This result shows that the length of EP is independent of P and Newton’s 
proof uses several properties of the ellipse. We will give a rather detailed overview 
of this particular proof as it gives an idea of the amount of work involved in mech- 
anising Newton’s geometric reasoning. Moreover, the reader can then compare 
Newton’s proof style and prose with our own proof and see the GTP methods 
we have formalised in action. 



R 




In Figure 5, the following holds 

— C is the centre of the ellipse with S and H the foci 

— P is a point of the curve 

— RZ is the tangent at P 

— the conjugate diameter D — K || P — Z 

— P — S intersects D — K at E 

— P[ — I \\ E — C and El — I intersects P — S at I 

Since H — I || E — C, the following theorem holds, 

H- I II E- C SdeitaCEI = SdeitaCEH (4) 

But the foci are collinear with and (by Apollonius III. 45 [I]) equidistant from 
the centre of the ellipse; so the following can be derived using the signed-area 
method, 

coll S C H len(S — C) x Sdeita C E H = len(C — H) x Sdeita C S E 

^ Sdeita C E H = Sdeita C S E (5) 



Also, points S, E and / are collinear and therefore combining with (4) and (5) 
above, we verify Newton’s ^^ES,EI will also be equal” 

coll S E 1 len(S — E) x Sdeita C E 1 = len(E — l) x Sdeita C S E 
=y> len(S — E) = len(E — l) 



( 6 ) 
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Next, the following derivations can be made, with the help of the last result 
proving Newton’s “EP is the half-sum of PS, PI” 



coll EIP 



len(E - P) = len(E - 1) + len(l - P) 
len(E - P) = len(S - E) + len(l - P) 

2 X len(E - P) = len(E - P) + len(S - E) -P len(l - P) 
2 X len(E - P) = len(S - P) + len(l - P) 

, , len(S - P) -P len(l - P) 

len(E - P) = ^ 4 ^ ^ 



( 7 ) 



Note the use of the following theorem in the derivation above 

coll SEP len(S - E) -P len(E - P) = len(S - P) 

Next, Newton argues that in fact (7) can be written as 

, , len(S-P) + len(H-P) 

len(E-P) = ^ ^ ^ (8) 

So, a proof of len(l — P) = len(H — P) is needed to progress further. This will 
follow if it can be shown that APHl is an isosceles, that is 

(P-H,H-1) = (H-l,l-P) (9) 



To prove (9), both H — 1 || P — Z and H — 1 || P — R are derived first using 

H-1||E-CAE-C||P-Z=PH-1||P-Z (10) 

H - 1 II P - Z A coll PZR =A H - 1 II P - R (11) 

From (10), (11), and the proof of Euclid 1.29 given in Section 2.3 



1 

1— 1 

1 

tsi 

II 


A (P-H,H-1) 


= (H - P,P - Z) 


(12) 


1 

1— 1 

1 

II 


(H-l,l-P) 


= (R-P,P-1) 






(H-l,l-P) 


= (R- P,P - S) 


(13) 



From the definition of the tangent to an ellipse and the collinearity of P, /, 
and S (also recall that full-angles are angles between lines rather than rays and 
are measured anti-clockwise). 



e_tangent (P 



Z) 



s 



H Ellipse 



(H - P,P - Z) 
(H - P,P - Z) 



(R-P,P-1) 
(R-P,P-S) (14) 



From (12), (13) and (14), the following is deduced as required 
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(P-H,H-I) = 

Thus, we have len(I — P) = len(H — P) (Euclid 1.6 [10]), and hence (8) is proved, 
that is, Newton’s assertion that “[EP is the half sum of] PS, PH". 

Next, it follows from the definition of an ellipse that the sum of len(S — P) and 
len(P — H) is equal to the length of the major axis, that is, 

P e Ellipse len(S - P) + len(P - H) = 2 X len(A - C) (15) 



From (15) and (8), we can finally derive the property that Newton states as 
being evident: “EP is equal to the greater semiaxis AC" 

len(E - P) = len(A - C) (16) 



The first step has shown Newton’s geometric reasoning in action. For the 
next steps, as the various ratios are derived, we will not always show the detailed 
derivations of the geometric theorems. We will concentrate on the setting up of 
the proportions and how everything is put together to get the final result. We will 
state Newton’s Lemmas when they are used and theorems about infinitesimals 
that we use. 

Step 2: Showing ^ ^ ^ 



R 




Fig. 6. Construction for Steps 2 — 4 of Proposition 11 



In Figure 6, in addition to properties already mentioned, the following holds 

— QT ESP 

— QxPR is a parallelogram 

— Q, X, and v are collinear 

— Q is infinitely close to P 



Proving Newton’s Propositio Kepleriana 



59 



It is easily proved that v — x |j C — E and so the following theorem follows 

V - X II C - E (P - V, V - x) = (P - C, C - E) (17) 



From (17) and the fact that APvx and APCE share P as a common vertex, 
it follows that they are similar. Also, since QxPR is a parallelogram, we have 
len(Q — R) = len(P — x). Thus, the following derivations follow 



SIM PVxPCE 



len(P - E) 
len(P - C) 



len(P — x) 
len(P — v) 



len(q - R) ^ len(A- C) 
len(P — v) len(P — C) 



One of the substitution used in (18) follows from (16) proved in the previous 
step. The equations above verify Newton’s ratios. 

Step 3: Showing ^ 

The proof of the ratio 



Lxlen(P-v) ^ ^ 

len(G — v) X len(P — v) len(G — v) 

is trivial and we will not expand on it. We only note that the constant L is 
known as the latus rectum‘s of the ellipse at A. 

Step 4: Showing 

By Apollonius 1.21 [1], if the lines DC and Qv are dropped ordinatewise to the 
diameter PC, the squares on them DC^ and Qv^ will be to each other as the 
areas contained by the straight lines cut off GC , CP, and Gv, vP on diameter 
PC. Algebraically, we proved the following property of the ellipse. 



len(D - C)2 _ len(G - C) x len(C - P) 
len(Q — v)^ len(G — v) X len(P — v) 

_ len(P - C)2 

len(G — v) X len(P — v) 

Rearranging the terms, we get the required ratio, 

len(G — v) X len(P — v) len(P — C)^ 
len(Q — v)2 len(D — C)^ 



^ The latus rectum is defined as L = 2 x len(B — C)^/len(A — C) 
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Step 5: Showing and intermediate ratios 



R 




In Figure 7, we have the additional property, 

- PF ±DK 

Again, it can be easily proved that Qx || EF. The following theorem then follows 
from Euclid 1.29 as given in Section 2.3 



Q - X II E - F : 



(Q - x,x- E) = 
(Q - x,x - T) = 



(F - E,E - x) 
(F - E,E - P) 



( 21 ) 



Since (P — F, F — E, ) = (x — T, T — Q) = tt/ 2 and (21), it follows that APEF and 
AQxT are similar. The next theorems (using (16) where needed) then hold and 
verify Newton’s intermediate results for the current Step. 



SIM PEFQxT 



len(Q — x)^ 
len(Q-T)2 



len(P - E)2 
len(P - F)2 



len(C- A)2 
len(P - F)2 



( 22 ) 



Newton’s Lemma 12 (See Figure 2) is now needed for the next result. According 
to the Lemma, the parallelogram circumscribed about DK and PG is equal to 
the parallelogram circumscribed about the major and minor axes of the ellipse. 
Thence, we have the following theorem 



len(C - A) X len(C - B) = len(C - D) x len(P - F) (23) 



Rearranging (23), we have len(C — A)/len(P — F) = len(C — D)/len(C — B) and 
substituting in (22), leads to 
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len(Q — x)^ len(C — D)^ 
len(Q -T)2 “ len(C - B)2 



(24) 



By Newton’s Lemma 7, Corollary 2, when the distance between Q and P 
becomes infinitesimal as they coincide, we have the following result [13]: 



len(q - v) _ ^ 
len(Q — x) 



(25) 



Now, to reach the final result for this step, we need to substitute len(Q — v) 
for len(Q — x) in (22). However, we cannot simply carry out the substitution 
even though the quantities are infinitely close. Indeed, one has to be careful 
when multiplying the quantities on both sides of the « relation because they 
might no longer be infinitely close after the multiplication. Consider, the non- 
zero infinitesimal e. 



e « but e X 1/e 9^ X 1/e 

It is possible, however, to multiply two infinitely close quantities by any finite 
quantity; the results are still infinitely close. This follows from the theorem: 



X ^ y A u G Finite x x u ~ y x u (26) 

Now, assuming that len(C — D) and len(C — B) are both finite but not infinites- 
imal (for example, len(C — D), len(C — B) e M), then len(C — D)/len(C — B) 
is Finite. Hence, the ratio of infinitesimals len(Q — x)/len(Q — T) is Finite. 
Therefore, from (25), (26), and using (24) the following theorem is derived: 

len(q - v)^ _ len(C-D)^ 

len(q-T)2 len(C-B)2 ^ ^ 

This gives the result that we wanted for the fifth step of the proof of Proposi- 
tion 11. We are now ready for putting all the various results together in the next 
and final step. This will then conclude the formal proof of the Proposition. 

Step 6: Putting the ratios together 

Combining (20) and (27), with the help of theorem (26) and some algebra 
yields. 



len(q — v)^ 
len(q-T)2 



len(C — D)^ len(G — v) x len(P 
len(C — B)2 len(q — v)^ 

len(G — v) X len(P — v) 



v) 

— G Finite 
len(P - C)2 



len(q-T)2 



len(C-B)2 



( 28 ) 
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which is combined with (19) to derive the next relation between ratios. The 
reader can check that both sides of the « relation are multiplied by finite quan- 
tities ensuring the results are infinitely close: 

Lxlen(P-v)^ len(P-C)^xL 
len(Q - T)2 len(C - B)2 x len(G - v) ^ ^ 

The next task is to combine the last result (29) with (18) to yield the follow- 
ing ratio which is equivalent to Newton’s “L • QR : QT'^ = AC ■ L ■ PC^ ■ CD^ : 
PC-Gv CD^ ■ CB'^" 



L X len(Q — R) len(P — C) x L x len(A — C) 
len(Q — T)2 len(C — B)^ x len(G — v) 



(30) 



But, we know that L = 2 x len(B — C)^/len(A — C), so (30) can be further sim- 
plified to give Newton’s other ratio “L • QR : = 2PC : Gu” 



L X len(Q — R) 2 x len(P — C) 
len(Q — T)^ len(G — v) 



(31) 



Once these ratios have been derived, Newton says “But the points Q and P 
coinciding, 2PC and Gv are equal. And therefore the quantities L ■ QR 
and QT^i proportional to these are also equal.” 

We formalise this by showing that len(P — v) « 0 as the distance between Q 
and P becomes infinitesimal; thus, it follows that 2 x len(P — C)/len(G — v) « 1 
and so, using (31) and the transitivity of w, we have the result 



L X len(Q — R) ^ 
len(Q-T)2 



(32) 



The final step in Newton’s derivation is “Let those equals be multiplied 
by and L ■ SP^ will become equal to ” • This final ratio gives 

the geometric representation for the force, as we showed in Section 4.2, and 
hence enables Newton to deduce immediately that the centripetal force obeys 
an inverse square law. 



We would like to derive Newton’s result in the same way, but remark that 



len(S — P) € Finite — Infinitesimal 
len(S-P)^ 



len(Q — R) 



G Infinite 



len(Q — R) S Infinitesimal 

(33) 



as Q and P become coincident. So, there seems to be a problem with simply 
multiplying (32) by Newton’s ratio SP^/QR since we cannot ensure that the 
results are infinitely close. Our formal framework forbids the multiplication that 
Newton does as the result is not necessarily a theorem! 
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Therefore, we need to find an alternative way of arriving at the same result 
as Newton. Recall from Section 4.2, that we have proved the following geometric 
representation for the centripetal force: 



force 



, len(Q-R) 1 

k X 7 -7 X 7 -7 

len(Q-T)2 len(S - P)^ 



(34) 



Now from (32), we can deduce that since L S Finite — Inf initesimal, the 
following theorems hold 



len(Q — R) 
len(Q - T)2 
len(Q - T)2 
len(Q — R) 
len(Q-T)2 
len(Q — R) 



€ Finite — Infinitesimal 
« L 

G Finite 



(35) 

(36) 

(37) 



Since (35) holds and l/len(S — P)^ S Finite, it follows that force € Finite and 
so we can now use the following theorem about the product of finite, infinitely 
close quantities 

a ~ b A c«d A a € Finite A c G Finite a x c ~ b X d 



with (34), (36), and (37) to yield 



force X L 



, len(Q-R) 1 len(Q - T)^ 

^ len(Q-T)2 ^ len(S - P)^ ^ len(Q - R) 



k X 



1 

len(S - P)2 



(38) 



Note that we also used the fact that « is symmetric in the derivation above. 
Finally from (38), we get to the celebrated result since L is finite (real) and 
constant for a given ellipse. 



force 



k _ 1 

L ^ len(S-P)2 
1 



force CXultimate 



len(S-P)2 



(39) 
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5 Final Comments 

We would like to conclude by mentioning some important aspects of this mech- 
anisation and possible changes to the geometry theory that could improve au- 
tomation. We also briefly review what we have achieved. 

5.1 On Finite Geometric Witnesses 

We have made an important remark about steps involving infinitesimals, ratios of 
infinitesimals and the infinitely close relation. Whenever we are dealing with such 
ratios, care needs to be exercised as we cannot be sure what the result of dividing 
two infinitesimals is: it can be infinitesimal, finite or infinite. We notice, when 
carrying out our formalisation, that whenever Newton is manipulating the ratio 
of vanishing quantities, he usually makes sure that this can be expressed in terms 
of some finite quantity as in the proof for Step 5 of Section 4.3. Thus, the ratio of 
infinitesimals is shown to be infinitely close or even equal to some finite quantity. 
This ensures that such a finite ratio can be used safely and soundly within our 
framework. The importance of setting up such finite geometric witnesses cannot 
be under-stated since the rigour of NSA might prevent steps involving ratios of 
infinitesimals from being carried out otherwise. Indeed, we have seen that the 
lack of a finite witness in the last step of Newton’s original argument prevents 
us from deriving the final result in the same way as he does. The alternative way 
we went about deriving the result, however, is sound and follows from rules that 
have been proved within our framework. 

5.2 Further Work 

In our previous work [13], we mentioned the existence of other methods, such 
as the Clifford algebra, that provide short and readable proofs [4,11]. Although 
these algebraic techniques are more difficult to relate with the geometric concepts 
that are actually used in Newton’s reasoning, interesting work done by Wang et 
al. has come to our attention in which powerful sets of rewrite rules have been 
derived to carry out proofs in Euclidean geometry [12,16]. It would be interesting 
to see how these could be integrated with Isabelle’s powerful simplifier to provide 
a greater degree of automation in some of our proofs. In a sense, such an approach 
would match in the level of details some of the results that Newton states (as 
obvious) and does not prove in depth. 

As an interesting observation, it is worth noting that the Kepler Problem 
can be proved, or even discovered, using algebraic computations. This has been 
demonstrated through the work on mechanics done by Wu [17], and also by Chou 
and Gao [7,8], in the early nineties. 

5.3 Conclusions 

We have described in detail the machine proof of Proposition 11 of the Principia 
and shown how the theories developed in Isabelle can be used to derive Newton’s 
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geometric representations for physical concepts. We have used the same combi- 
nation of geometry and NSA rules introduced in our previous work to confirm, 
through a study of one of the most important Propositions of the Principia, that 
Newton’s geometric and ultimate procedures can be cast within the rigour of our 
formal framework. The discovery of a step in Newton’s reasoning that could not 
be justified formally- in contrast with other ones where Newton explicitly sets 
up finite witnesses- is an important one. The alternative derivation presented in 
this work shows how to use our rules to deduce the same result soundly. 

Once again, the mechanisation of results from the Principia has been an in- 
teresting and challenging exercise. Newton’s original reasoning, though complex 
and often hard to follow, displays the impressive deductive power of geometry. 
The addition of infinitesimal notions results in a richer, more powerful geometry 
in which new properties can emerge in ultimate situation. Moreover, we now 
have new, powerful tools to study the model built on Newton’s exposition of the 
physical world. 
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Abstract. Readability is a fuzzy concept and there are several levels of 
readability. Readable machine solving avails the development of mathe- 
matics culture and is useful to education. There are several methods of 
readable machine solving, such as the logic method, points elimination 
method, geometry information searching system or deductive database 
method. Based on these methods, some new types of educational soft- 
ware have been developed. As an example, the intelligent educational 
software “Mathematics Lab: Solid Geometry” (MSG) is introduced in 
this paper. 



1 Introduction 

Because of various reasons, machine proving of geometry theorems is one of 
the main focuses of automated reasoning. Since the pioneering work of Wu [28] 
in 1977, highly successful algebraic methods for solving geometry problems have 
been developed. For most geometry statements, one can assert whether it is true 
or false by a computer system using these methods within seconds [1]. 

These algebraic methods generally can only tell whether a statement is true 
or not. If one wants to know the proving process, one usually has to look at 
tedious computations of polynomials. In other words, the solution produced by 
algebraic methods is generally not readable. 

Researchers have been studying automated generation of readable solving 
using computer systems since the work by Gelernter and others [20]. With a 
great amount of efforts in the past 40 years, especially with the new techniques 
developed in recent years [31,2,29,10], the advance in this direction is noticeable. 
There are now two classes of methods, the points elimination methods and the 
geometry deductive database methods (abbr. GDD method), to produce read- 
able solutions for geometric problems. Using these methods, about one thousand 
non-trivial geometric problems were solved by computer programs and most of 
the solutions are short and readable. 

The research on readable machine solving is useful in education. Two pieces of 
IGAI software named Geometry Experts (abbr. GEX) [17,18,16] and Mathematics 
Lab: Solid Geometry (MSG) are developed based on this work. Now both are used 

* This work was supported in part by the Chinese National Science Foundation, the 
863 Foundation and DOES company. 
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in about 400 middle schools in China, and were warmly welcomed by teachers 
and students. 

This paper reviews recent advances on readable machine solving and then 
introduces the ICAI software MSG. In Section 2, we give an introduction to 
the readable machine solving methods. In Section 3, we introduce the points 
elimination methods with some examples. In Section 4, we explain GDD (GISS) 
methods briefly. Section 5 introduces the structure and working principle of 
software MSG, and lists its functions. In the last Section, we give some comments 
on future development and propose some problems for future research. 

2 What Is Readable Solving and Why Readability Is 
Wanted 

By readable solving we mean algorithms that not only find the solutions of certain 
problems but also generate a human readable solving process such that one can 
check why the solution is true and how to get to the conclusion. 

Gertainly, readability is a fuzzy concept. Generally, if a solving process is very 
long, it will be difficult to check and understand. So we may say that a very long 
solution is not readable. But we cannot determine exactly how long is very long 
or how short is readable. Moreover, there are several levels of readability. A proof 
for a geometry theorem may be readable to a mathematician but not to a student 
in middle school. Therefore, we use the word readable with a different meaning 
in different situations. For instance, while talking about the education, readable 
solving means that it can be checked and understood easily by most students. 
There are two obvious reasons for seeking readability in problem solving: 

1 Readability avails the development of mathematics culture. Mathematics is 
not only a science field but also a culture. The development of mathematical 
culture needs the support from most people. If people were told only the 
mathematical conclusion but can not check and understand the process, 
they will lose their interest in mathematics. 

2 Readability is useful to education. While the students study mathematics, 
they mainly learn the ideas, principles and methods. So they should under- 
stand why a conclusion is true and how to get it. In our viewpoint, readable 
machine solving can play an important role in IGAI (Intelligent Gomputer 
Aided Instruction). 

Because it is very difficult to write programs to produce readable solutions for 
various mathematics problems, most mathematics software can only give the fi- 
nal conclusion. The research of readable machine solving, especially in geometry, 
is a very attractive field for a long time to come. 

3 A Sketch of the Point Elimination Method 

In 1992, a new idea for automated geometry solving was proposed and imple- 
mented [31,2]. It is called the point elimination method. The program based on 
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the method can produce short and readable proofs for most non-trivial geome- 
try theorems. Roughly speaking, this approach is to represent the hypotheses in 
a statement in a constructive way and to describe the conclusion of the state- 
ment by equations of geometric quantities. The proving process is to eliminate 
geometry objects from the conclusion by using a few formulae according to the 
constructions and the geometric quantities. We will use two examples to illustrate 
this idea. 

We will use capital English letters to denote points. We denote by AB the 
signed length of the oriented segment from Ato B and its absolute value by AB. 
The signed area Sabc of triangle ABC is the usual area with a sign depending 
on the order of the three vertices of the triangle. About the relationship between 
the area and the length, there is a very useful tool: 



Proposition 1 (The Co-side Theorem). Let M he the intersection of two 
non-parallel lines AB and PQ and QAM. Then £M= = 

^ - 1 . - 1 . / Qj^j bQAB 



The co-side theorem is often used to eliminate the intersecting point of two 
lines. This will be shown clearly in the following example. 



Example 1. Let O he the intersection of the two diag- 
onals AC and BD of a parallelogram ABCD. Show 
that O is the midpoint of AC, or == = 1. 




Analysis. To show = 



1, by using the Co-side Theorem we have 



AO _ Sabd 
OC Sbcd 

Point O is thus eliminated from the conclusion. Now the problem becomes to 
show Sabd = Sbcd- Because AB |j CD and AD || BC, we have 



Sabd = Sabc, Sbcd = Sabc- 



So point D is also eliminated and now it is evident that the conclusion is valid. 

It is now possible to get some basic idea about the Points Elimination method 
from the above example: 

(1) The geometric quantities. Here they are ratios of signed lengths and signed 
areas. We use the algebraic expressions in geometric quantities to represent 
the goals of problems. For more problems, more geometric quantities would 
be introduced. 

(2) The constructions. In example 1, we can use the constructive process to 
represent the conditions of the problem. Points are introduced by construc- 
tions in the order: A,B,C,D,0] and are eliminated in the reverse order: 
O, D, A, B, C. Of course, we can use more kinds of constructions to enlarge 
the class of problems. 
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(3) The formulae. To eliminate a point, formulae are needed. The main task in 

building a proving system is to choose the proper formulae for eliminating 

points. 

Let us use the idea of Eliminating Points to solve another problem: 

Example 2. M is the midpoint of the side BC of triangle ABC and N is on the 
side AC such that AC = 3JVC. Point P is the intersection of lines AM and 
BN. Find the ratio =4£=. 

PM 

For the problem, the constructive process can be: 

Cl Take any three non-collinear points A,B and C. 

C2 Draw point M as the midpoint of segment BC. 

C3 Take point N on segment AC such that AC = 3NC. 

C4 Draw point P as the intersection of two lines AM and BN. 

Solution by Points Elimination method 

== = (To eliminate P by Co-side Theorem) 

= (To eliminate M) 

2 ~ 

= 2^= =2*2 = 4 (To eliminate TV). 

Another important question is how to select the formula to eliminate a given 
point. The answer depends on what kind of geometric quantity the point ap- 
peared in and which construction the point was introduced by. As an instance, 
if point P is introduced by construction intersection of two lines and appeared 
in geometric quantity ratio of two segments, then it can be eliminated by using 
the co-side theorem. Therefore, if m kinds of geometric quantities and n types 
of constructions are used, the number of formulae needed for eliminating points 
should be about the nm. 

Based on the idea of eliminating points, we can develop various methods 
for geometry solving by choosing different geometric quantities, constructions 
and formulae for eliminating points or other geometry objects. In fact, a series 
of algorithms based on this idea have been developed. The area method for 
Euclidean plane geometry was first proposed in [31] and improved in [3,2,6]; the 
volume method for solid geometry was given in [8] . To get more readable solutions 
for some classes of problems, the full-angle method [5], the vector method and 
complex number method [4,7,6,26] were developed. The eliminating method for 
non-Euclidean geometry, by which many new theorems have been discovered, 
was established later in [29]. In the following, we sketch only the methods for 
Euclidean plane geometry. They are the area method, the vector or complex 
number method and the full angle method. 

More recently, the points elimination method has been extended to the classes 
of geometry problems in which the geometric quantities can be represented as 
elements in the Clifford Algebra [22,27,14,30]. Using the extended Points Elim- 
ination method, a computer program has been developed, by which one can 
solve problems involving lines and conics and the solutions are generally short 
and readable [30]. 
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Among all the above methods, the area method is the most useful one. It is 
a complete method for constructive propositions in Euclidean plane geometry. 
In the area method, we need three geometric quantities: the sign areas, ratios of 
collinear or parallel oriented segments and a new quantity named Pythagorean 
Difference. The Pythagorean Difference is used to treat geometry problems in- 
volving perpendiculars or angles. For points A, B, and C, the Pythagorean dif- 
ference Pabc is defined to be Pabc = AB CB — AC . 

For four points A, B, C and D, the notation ABACD implies that one of 
the following conditions is true: A = B, or C = D, or line AB is perpendicular 
to line CD. For a quadrilateral ABCD, let Pabcd = Pabd — Pcbd = AB -\- 
CD —BC —DA . Then, the Pythagorean theorem can be generalized as follow. 



Proposition 2. ACABD iff Pabd = Pcbd or Pabcd = 0. 

As an application of the above proposition, we give: 

Example 3 (The Orthocenter Theorem). Let the two alti- 
tudes AF and BE of triangle ABC meet in H . Show that 
CHEAB. 



Proof. By Proposition 2, we need only to show Pach = Pbch- Since BHEAC 
and AHEBC, by Proposition 2 we can eliminate point H: Pach = Pacb] 
Pbch = Pbca- Thus we have the proof: 

Pach _ Pace _ Pace _ ^ 

Pbch Pbca Pace 



In the construction process, we always use several introduced points to decide 
a new point. Such as, if points A,B,C ,D are known, we can get the intersection 
of lines AB and CD. All of the secrets to eliminate points are how to represent 
the position of a new point with that of the old ones. By the co-side theorem, 
we use the ratio of areas to represent the position of intersection. 

Now we are going to explain the vector method or complex number method 
briefly. For each point P in the plane, let be the corresponding vector from 
the origin to P or complex number, and P the conjugate number of Then 
we denote by aS = — A the vector from point A to point B. 

The inner product of vectors A)§ and C3 is denoted by (A)§,Cl5), and the 
exterior product by [AS, C15]. 

We can define the signed area of a triangle ABC as 



Sabc 



[Ag,AS] 

2 



Then the exterior product [AZ?, Cl5\ is actually the signed area or the quadri- 
lateral ACBD, i.e.. 



[Ag,cA] 



Sacbd = Sacd — Sbcd 



2 
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Then we have 

Sabc = = i((S - A)(d -1)(C - i)). 

Pabc = 2{b 1, W) = 0- ~A){C -A) + {B- A)(d - ~1). 

Therefore, the area method for Euclidean geometry can be translated into 
the language of vectors or complex numbers. The solutions thus produced are 
generally longer than those produced by the area method. But the vector or 
complex method is still useful for solving the problems in some classes. Because 
this method has two advantages: 

( 1 ) The vectors or complex numbers themselves can be eliminated as geometric 
quantities directly. 

(2) The vectors or complex numbers are useful to represent the relationships 
about angles or similar triangles. 

We give an example for vector method: 

Example 4 (Cantor’s Theorem). Suppose A,B,C,D are on a circle with cen- 
ter O and E,F,G be the midpoints of CD,AB and AD respectively. If N be the 
intersection of the lines through E parallel to OF and through F parallel to OE, 
then the conclusion can be represented by 

{{gS,W) = {N^,B(J))) 



(i.e. GNEBC) 



Here is the machine proof by the vector method. 

(BdBd) 

{W,bU) 



The Conclusion. 



(W,b6)i 

{Bd.wd)-! 

i{W,W)-{W,W)+i/2{W,B3)+i/2{CS,cS))-i 

{2y{Bd,Bd)i 

{-2)-{1/2{B^.Wi5) + 1/2{WA.WJ))-1 

-{{W,Bt>)+(BA^))-(2) 

{-2{Bd,B3)-2{Bl,Bd))~i 



51. Elim. point N. 

52. Elim. point E. 

53. Elim. point F. 

54. Elim. point G. 

55. Elim. point O. 

S6. Simplify. 



For solving geometric problems about angles, using angle as the main ge- 
ometric quantity directly will be better than using both area and vector. We 
use a new kind of angle called full-angle, which is more convenient to machine 
solving. Intuitively, a full-angle Z[m, v] is the directed angle from line u to line v. 
Two full-angles Z[/, m] and Z[m, ?;] are equal if there exists a rotation K such that 
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K{1) II u and K{m) || v. For the geometric meaning of the addition of full-angles, 
let l,m,u, and v be four lines and K be a rotation such that K{1) || v. Then 
l[u, ri] -I- Z[Z, to] = l[u,K{m)]. 

For all parallel lines AB || PQ, Z[0] = l[AB, PQ] is a constant. For all 
perpendicular lines ABPPQ, Z[l] = /.[AB, PQ] is a constant. It is clear that 
Z[l] -I- Z[l] = Z[0] and /[u,v] -b Z[0] = /[u,v\. 

Using full-angle method, many geometry properties can be described con- 
cisely. For instance, we have: 

Proposition 3. FI For any line UV , /[AB,CD] = /[AB,UV] + /.[UV,CD], 

F2 PQ is parallel to UV iff /[AB, PQ] = L[AB, UV] for any point A y/ B . 

F3 PQ is perpendicular to UV iff L[AB,PQ] = Z[l] -b L[AB,UV]. 

F4 For three non-collinear points X, A, and B, XA = XB iff /[AX,AB] = 
L[AB,XB], 

F5 (The Inscribed Angle Theorem) Four non-collinear points A, B, C , 
and D are cyclic iff /[AD, CD] = /[AB,CB], 

For ordinary angles, above propositions are true only under certain additional 
conditions like “a line is inside an angle” , “two points are on the same side of a 
line”, etc. 

Even using the above simple properties of full-angles, we can produce short 
solutions for many difficult geometry problems automatically. The technique of 
eliminating points is now extended to that of eliminating lines. For instance, if 
PQ is perpendicular to UV, by F3 we can eliminate line PQ from full-angle 
/[AB, PQ]\ /[AB,PQ] = Z[l] -b /[AB,UV]. We illustrate the method with an 
example. 



Example 5 (Miquel’s Axiom). ^ If four circles are 
arranged in sequence, each two successive circles in- 
tersecting, and a circle pass through one pair of each 
such pair of intersections, then the remaining inter- 
sections lie on another circle. Figure 1-3 

Let us assume that the four circles are ABFE, BCGF, CDHG and DAEH. 
Show that if points A, B, C and D are cyclic then points E, F, H and G are 
also cyclic. 

The Machine Proof 
Z [hg,he]-/ [FG,FE] 

{/[HG,HE] = /[HG,HD]-i-/[HD,HE] = /[GC,DC]-/lEA,DA], 

because cyclic(G, h, d, c), cyclic(£;, h, d, a). (FI, F5) ) 

= -/[fG,FE] + /[GC,DC]-/[EA,DA] 

^ All previous methods failed to prove this theorem automatically by far due to com- 
puter system memory limitation. 
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{/.[fG,FE] = /.[FG,FB] + /.[FB,FE] = /.[GC,CB]-/.[EA,BA], 
because cyclic(G, f, b, c), cyclic(F, f, b, a). (FI, F5) ) 

= 1[GG,DG]-1[GG,GB]-1[EA,DA] + 1[EA,BA] 

{L[gc,dc]-/.[gc,cb]=-1.[dg,gb]. (FI) ) 

= —1.[eA,DA] + /.[EA,BA]-/.[DG,GB] 

{ — /.[eA,DA] + /.[EA,BA] = /.[DA,BA]. (FI) ) 

= -L[dG,GB] + /.[DA,BA] 

{L[dC,CB] = /.[DA,BA], 

because cyclic ( c, f, b, a). (F5) ) 

= Z[0] 

The full-angle method as reported here is not complete. By using the area 
method, we can give a complete method for proving theorems involved full- 
angles. Full-angles are also useful in GDD method, which will be mentioned in 
next Section. 



4 The Geometric Information Searching System 

To obtain proofs in a more traditional style, the algorithm based on the search 
method has been proposed in [9,10]. It is called GDD (Geometry Deductive 
Database) method or GISS (Geometry Information Searching System) method. 
For a given geometric configuration, using GDD method, one can get the infor- 
mation bank containing all the messages about the figure that can be deduced 
from the built-in reasoning rules. It is meaning that the fixpoint 

The work uses the synthetic approaches are closely related to GDD method. 
In [23], Nevins used a combination of forward chaining and backward chaining 
with emphasis on the forward chaining. But before [9], all previous synthetic work 
deals with theorems involving straight lines only and the fixpoint is not reached. 
The idea of adding auxiliary points and the problem of including circles into the 
program are discussed respectively in [25,24] and [23] but both not implemented. 

The program based on GDD has been tested with 160 geometry configura- 
tions ranging from well-known geometry theorems such as the centroid theorem, 
the orthocenter theorem, and Simson’s theorem to problems recently proposed 
in the problem section of the American Mathematical Monthly. The program 
not only finds most of the well-known properties of these configurations but also 
often gives many unexpected results. As is known, most of the 160 theorems can 
not be proved by the previous programs based on synthetic approaches. 

The basic ideas behind the GDD method, such as fixpoints and the treat- 
ment of negative clauses, come from the deductive database theory [15]. But the 
strength of GDD method is mainly based on the following improvements: 

(1) We propose the idea of structured deductive database. In traditional de- 
ductive databases, each n-ary predicate is associated with an n-dimensional 
relation. This is not suitable for geometry for two reasons: the excessively 
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large database and repetitive representation of information. On average, the 
databases for the 160 tested geometry configurations would be of size 242,117 
if using the traditional representation. This problem was solved in ODD by 
using some simple mathematical structures such as sequences and equivalent 
classes to represent facts in the database. The size of the structured database 
is one thousand times smaller. 

(2) We propose a new data-hased search strategy . ODD keeps a list of “new 
data” and for each new data the system searches the rule set (or intensional 
database) to find and apply the rules using this data. If using the data-based 
strategy, the redundant deductions caused by repeated application of a rule 
to the same facts will be automatically eliminated. 

(3) Selecting appropriate rules. All the previous work based on the synthetic 
approach [20,23] uses geometric rules about the congruent triangles as its 
basic geometric rules. Most of the basic results in geometry such as the cen- 
troid theorem, the orthocenter theorem, and Simson’s theorem are beyond 
the scope of these rules without adding auxiliary points. 

(4) Adding of auxiliary points. In GDD, about twenty rules of adding auxil- 
iary points are implemented and thirty nine of the one hundred sixty con- 
figurations solved by the program need auxiliary points. This is the first 
implementation of a nontrivial set of rules of adding auxiliary points. 

(5) Numerical diagrams of geometry statements are used as models to deal with 
the negative information in the rules. The program can construct the diagram 
automatically for a class of constructive geometry statements. In all the 
previous work using diagrams as models [20], the diagrams are constructed 
by the user. 

Here we give an example for the GDD method: 

Example 6. Let the three altitudes of a triangle ABC are AD,BE,CF. Also 
point D is on the segment BC. Show L[FDA\ = L[ADE], 

The conclusion can be represented by full-angles in form 

L[DF,DA\ = L[DA,DE\ 

The machine proof by GDD method is like the following: 

(1) L[DF,DA\ = l.[DA,DE\ (by (2) and DEED A). 

(2) L[DE,DB] = L[DB,DF] (by (3)+(4)). 

(3) L[BD,BF\ = ^[ED,EC] (by (5)). 

(4) l[BF,DF] = l[EC,CD] (by (6)). 

(5) A, D, B, E are concyclic (by DEED A and EBEEA). 

(6) A,D,C,F are concyclic (by DC EDA and FCEFA). 

By the GDD method, the computer system not only obtains the conclusion, 
but also produces a lot of informations of the figure, such as: 24 pairs of equal 
angles, 7 sets of similar triangles and 30 pairs of equal ratio of segments. 
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5 Educational Software MSG 

There are several methods to produce readable solving for geometry problems. 
Every method has specific advantages. For the same theorem, one can get proofs 
with different styles by using different methods. Using the above method, about 
one thousand non-trivial geometry problems (most of them can be found 
in [2,12,13,11]) were solved by computer programs and most of the solutions 
are readable and short. 

These automated reasoning methods for readable machine solving are mature 
enough to be used in ICAI. Using these methods, we can make our educational 
software more intelligent and attractive. How to use automated reasoning meth- 
ods to develop educational software? What would be the educational software 
with automated reasoning like? The intellective educational software MSG [21] 
may give us some explanation. 

The full name of MSG is Mathematics Lab: Solid Geometry. It is one of a 
series educational software named Mathematics Laboratory. To fit the request of 
teachers and students in middle schools and high schools, the working style of 
MSG is more traditional than GEX. It means that, unlike the volume method for 
solid geometry in [8] , we use only quantities that appeared in the geometry text- 
book. The program of MSG consists mainly of four parts. They are constructing 
tools, information boxes, reasoning rules and the solving controls. 

The constructing tools allow users not only to draw solid geometric figure, 
but also send the constructing process to information boxes. Based on the infor- 
mation, MSG can do graphic translating and reasoning for solving. 

A deductive database consists of information boxes. Every box is for saving 
one kind of information. There are four kinds of angles, six kinds of distances, 
three kinds of perpendicular and three kinds of parallel in solid geometry. So the 
database is more complex than that in plane geometry. 

We use about 220 reasoning rules. A rule may be one axiom, lemma, theorem, 
formula, definition or skill for solving. They are chosen from plane and solid 
geometric textbooks in middle schools and high schools in Ghina. When a solid 
figure was drawn and a problem was proposed, MSG can do reasoning by using 
part or all of the rules to get new information about the figure. Of course, new 
information will be put into the database. The process will stop until a solution 
is obtained. 

The solving controls can select or remove rules used in reasoning. This is 
mainly according to the knowledge level of students or the request by user. 
When automated solving failed, the controls may give, by an expert knowledge 
bank, some suggestions about how to add auxiliary points and auxiliary lines. 

We will show the functions of MSG using the following example. 

Example 1. ABCD is a square, EA _L planeABCD, AF _L BE, F is the foot 
from A to line BE and G the foot from F to line CE. Show that AG _L EC. 
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FUNCTION 1. Dynamical Construct and Generating Problems 

To solve a problem by computer system, it is necessary to make the envi- 
ronment in which the user can describe his problems conveniently. After the 
software MSG was started, the construction menu or toolbar can help the user 
to draw various solid figures by mouse and to prepare proposing problems. This 
is the construction of our example: 

1. Take two points A,B in space. 

2. Draw a square ABCD. 

3. Add segments EA,EB,EC. 

4. Take a point E at the line through A and perpendicular to plane ABC. 

5. Draw the foot F from point A to line BE. 

6. Draw the foot G from point F to line EC. 

7. Add segment AG. 

Now the figure you constructed is as the following figure. 





Suppose: 

BAD C is a square. AElplaneBCA. AFIBE. F 
IS the foot, FGI CE, G is the foot. 

Show that GAl EC. 



Figure 2-1 



While you drag a free point by mouse, the graph would change, but it still 
keeps the geometry properties given during constructing. The program will auto- 
matically remember the constructing procedure. These graphs are dynamical and 
can be changed. They can always keep their properties given by the constructing 
procedure. In other words, they have intelligence and know the characteristics 
they should have. When a graph is constructed, you can add a conclusion such 
that some problems will be generated. In the above example, we should add the 
conclusion: AG _L EC (Figure 2-1). 

FUNCTION 2. Automated Reasoning and Alternate Solving 

Based on the basic properties of the graph and the additional conditions, 
using a series of rules (axioms, definitions, theorems and formulae), the program 
can find many geometric properties of the graph and solve the problems proposed 
by user. For this example, the program will automatically get the conclusion 
AG _L EC (Figure 2-2) 
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Suppose: 

BADC is a square. AElplaneBCA. AFIBE. F 
is the foot, FG Ice. G is the foot. 

Show that GAl EC. 

Solution: 

AFIeFB (known) (0 

BADC is a square (known) 

AElplaneBCA (known) 

AEiCB (by rule 13 and (3)) W 

CB 1 plane AFBE (by rule 20 and (2) (4)) (^) 

CBIAF (by rule 13 and (5)) 

AF 1 plane FBEGC (by rule 20 and ( 1) (6)) 

GFlEGC (known) 

AGIEGC (by rule 14 and (7) (8)) 



Rule 1 4 :( Three-perpendicular-line Theorem) If a line 
in a plane is perpendicular to the projection of one oblique 
line of this plane, then the line is also perpendicular to 
the oblique line. 



Figure2-2 





aca, alAO. => alPA 






Figure 2-3 
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and list the reasons step by step. If one asks for the rule used in a step, the 
program can give details of the rule (Figure 2-3). 



Problem — E:\Does\LTJH\XT\paper.ljh 



Suppose: 

BAD C is a square. AEl plane BCA. AFIBE. F 
is the foot, FG Ice. G is the foot. 

Show that GAi EC. 



Solution: 

BADC is a square (known) 

ABICB (Rule 34) 

AEI plane BCA (known) 

AElCB (Rule 13) 

CBl plane BAE (Rule 20) 

BClPlaneGFA 

(Think about again ! Is the Assertion True?) 

Figure 2-4 



Bi^og Pwe for Alternate Solving 






^ Geometry Informations in the Prot 
Four Points are Coplanar 
1^ Perpendicular Lines 
: Q 00:AFJ_EFB 
■ Q 01:GFJ_EGC 
— 

- Q 03:AE±CE 
I I A Line Perpendicular to a Plane 
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Selected Infomations 






m 



00:AFJ_EFB 

02:AE1a.B 



[— New Infomations 



Informatio] Got 4 of New Messrs 
Conclusion | Hasn't Gel Conclusion 



About Rules 
Next 

Operate ii|||ii||| IIIIIIIII|I||IM I 



Rules can be 



Hypothesis 



Three Points are CoUinear 



ane < 



“ Content of Rule 

Rule 0:(Axiom 1) If two points onaline are ina 
plarre.then all the points on the same line are also in 
this plane. 



Hint for Solvirjg 



|^&.AG±EGC (Rulel4) 

[i & AF±plane FBEGC (Rule20) 

Q AF±EFB ( Known) 
&-CB-LAF (Rule 13) 

|i(-Cn CB±plane AFBE(Rule20) 






' I t rr 



J 



Restart Reasoning by Step Delete Limited Reasoning 



I Computer Searches new Message from Selected Information 



Figure 2-5 
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The procedure of automated reasoning is closely related to the figures. If 
point D is between point A and B, AB = AD + BD, otherwise AB = BD — AD. 
Users can do exercises themselves by using the tools of “Manual Solving” . When 
you give yourself a reasoning step by step, the program will judge each step 
and give comments on your reasons. The program will keep the reason step if it 
thinks it is right (Figure 2-4). 

Alternatively, the user may choose some facts from the information pro- 
vided by computer system as the presupposition. Then the computer system 
will do a step of reasoning based on the facts, until the conclusion become clear 
(Figure 2-5). This is called “Alternate Solving”. 

Alternate solving is easier than “Manual Solving” . The user can choose one or 
more geometrical information from the database including known conditions and 
deduced properties. The computer system will infer more properties or conclu- 
sions from them and give the new information for the user to choose (Figure 2-5). 
When the conclusion is obtained, the computer system will generate a proof by 
the thinking of the user. Users can inquire the detail of every reasoning rule any- 
time (Figure 2-3) and try to remove or add some built-in rules. If you inform the 
computer system that your students are learning some chapter, the computer 
system will use only the rules learned by the students (Figure 2-6). 




Figure 2-6 

FUNCTION 3. Measure and Rotation in Space 

The computer system can find the length of a segment, the degree of an 
angle, the area of a triangle, and the volume of a prism, etc. The ruler could be 
fixed or dynamical. The data measured will change while the graph is changed. 
In the above example, ABCD should be a square. But its four angles don’t seem 
to be right angle and its four sides don’t seem to be equal to each other. Users 
can measure the lengths of AB, BC and its angles to check whether ABCD is 
a square or not. And one can make a rotation to see that ABCD is indeed a 
square at some position (Figure 2-7). The data in the screen will be changed 
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along with points moving. When EA = AB, LGAF is of 30 degree (Figure 2-8). 
Then, we discovered an interesting problem. The computer system will give its 
proof using the function of reasoning (Figure 2-9). 
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FUNCTION 4. Part Copy and Translation of Figure 

Using this function, we can imitate the partition of a solid. For instance, 
we can draw a prism ABC — XY Z first, then copy three tetrahedrons AXYZ, 
ABCZ and ABY Z from this prism and move them to other places using the 
parallel moving function. So that we can clearly explain the truth of “The volume 
of a pyramid is equal to 1/3 of the prism which has with the same bottom and 
equal height with the pyramid” (Figure 2-10). 
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FUNCTION 5. Section of a Solid 

The user can cut a prism, a pyramid, etc. with arbitrary assigned plane. Such 
as the section of cube which is cut by its main diagonals mid-perpendicular-plane 
is a regular hexagon. And we can see its exact shape after rotating the cube to 
an adequate position (Figure 2-11). 

6 Concluding Remarks 

We have seen that machine readable solving in geometry is useful for developing 
the ICAI software. The ICAI software based on automated reasoning can pro- 
vide a micro-world, which represents the knowledge structure of real solids. The 
micro-world has at least three advantages for teaching and learning: 

(1) In the micro- world, the knowledge about the solids is divided into some 
elements and relationships, and these basic parts can also be combined to 
make meaningful structures. User can control and operate on them with both 
graphic translating and symbolical reasoning. So it is a full-scale model of 
real solids. 

(2) The dynamical construction provides an environment for students to study. 
They can change the figure on screen continuously by hand and observe it 
to find geometric conjectures. 

(3) It can be a stage on which teachers can build their knowledge bank by 
saving solved problems. The bank may help teachers to prepare their lessons 
more easily. 

Under the teacher’s instruction, students can do constructing, reasoning, 
computing, measuring and figure translating in the micro-world. With the com- 
puter system, they may ask, answer, guess and create problems. Learning will 
become more interesting. 

But this is just a starting point. There remains a lot of work to do. Such as: 

(1) To develop similar software for physics and chemistry. Problems in these 
courses may be easier than in geometry. But representing knowledge is more 
complex, and there would be more kinds of problems. 

(2) To improve the program such that users can add reasoning rules proved by 
themselves. 

(3) In machine readable solving for geometry, it is faster to use signed area 
and full-angle. But solutions using traditional geometric quantities would be 
more welcomed. An idea is to first reason with any effective tool, and then 
translate the solution into a more traditional style. 

We hope that more work will appear in this direction. 
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Abstract. An automatic reasoning system for plane Euclidean geom- 
etry should handle the wide variety of geometric concepts: points, vec- 
tors, angles, triangles, rectangles, circles, lines, parallelism, perpendicu- 
larity, area, orientation, inside and outside, similitudes, isometries, sine, 
cosine, ... . It should be able to construct and transform geometric ob- 
jects, to compute geometric quantities and to prove geometric theorems. 
It should be able to call upon geometric knowledge transparently when it 
is needed. In this paper a type of ring generated by points and numbers 
is presented which may provide a formal basis for reasoning systems that 
meet these requirements. The claim is that this simple algebraic struc- 
ture embodies all the concepts and properties that are investigated in 
the many different theories of the Euclidean plane. 



A reasoner for plane Euclidean geometry would 

~ know basic geometric facts; 

— know basic trigonometry; 

— handle complex numbers; 

— compute geometric quantities, such as areas of polygons and cosines of an- 
gles; 

— work with diagrams; 

~ interact transparently with reasoners that need geometry; 

— prove geometry theorems. 

The best current computational systems are far from meeting these require- 
ments. On the one hand, while computer algebra systems such as Mathematica 
and Maple, have some capability of handling local aspects of this program, such 
as deriving trigonometric formulae, their reliance on ad hoc rules makes the 
problem of integration of these fragmentary capabilities quite intractable. On 
the other hand, the wonderful geometry theorem provers based on the meth- 
ods of Wu and Buchberger, are dependent upon introduction of coordinates, a 
feature that in its own way is ad hoc^ since we know that coordinates are in 
Fact substantially irrelevant. Though one might hope that Euclid’s original ax- 
iomatic approach would be the way forward, attempts to automate it have had 
scant success. 

In my opinion what is needed for the automation of reasoners with the in- 
dicated capabilities is an expressive, computationally tractable, simple uniform 
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formalism. In this paper a candidate for that role is presented. An algebraic 
structure, called a Euclid ring, is introduced that is generated by points and real 
numbers, that fully supports Euclidean plane geometry, trigonometry, transfor- 
mations and complex numbers, and that is coordinate-free. The vector fragment 
of this structure is the well-known two-dimensional positive-definite Clifford al- 
gebra introduced by W. K. Clifford. To use this algebra, many authors identify 
points with “vectors from the origin”; this is ungeometrical and avoidable, as 
our treatment shows. 

No new geometric results are presented in this paper. Its thrust is rather to 
demonstrate the richness of Euclid ring semantics by showing how concepts such 
as angle and area arise naturally, and to demonstrate the algebraic power of the 
Euclid ring formalism by giving a range of elementary computations and proofs 
that may be obtainable automatically by methods currently under development. 

1 Mathematical Structure 

A Euclid ring is a ring 17, with unit element 1, and subsets TZ and V such that 

1. TZUV generates 17; 

2. TZ is the field of real numbers and is a subring of 17; 

3. 7^ is a 2-dimensional affine space over TZ] 

4. aA = Aa for a € 7Z, A € V; 

5. (B - A)2 > 0 for A yf B € V; 

6 . OiV. 

Elements of V are called points and denoted by Roman capitals A, B, C, 
. . . , Q] note that U, V, W, X, Y and Z do not denote points. Real numbers 
are denoted by Greek letters, a, /3, 7 , . . . . The number 0 is the additive identity 
of the ring and the number 1 is its multiplicative identity. V is closed under 
affine combinations (meaning that A,B £ V, a + j3 = 1 ^ aA Y f3B G V) 
and is generated by three linearly independent points. Indeed, thanks to items 1 
and 3, 17 itself is generated by the real numbers and three linearly independent 
points. 

All elements of a Euclid ring are sums of products of numbers and points and 
it is easy to calculate with them; for example, to verify that if P = ^{A + B + C) 
then 

(A - Bf + {B- Cf + {C-Af = i [(A - Pf + {B- Pf + {C - Pf] 

is a matter of high-school algebra (though one must not assume that multipli- 
cation is commutative). 

We will progressively show how various types of geometric objects, includ- 
ing vector and angle, geometric quantities, including length and area, geometrie 
predicates, including perpendicularity and parallelism, and geometric transfor- 
mations, including translation and similitude, are represented in Euclid ring 
terms. Note, however, that, just as in natural language, it is not the case that 
every well- formed term has an interpretation. 
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2 Rules of Interpretation and Physical Facts 

A good way to approach the semantics of a Euclid ring is to view it as providing 
an algebraic theory of a physical domain, the drawing board of the engineer or 
architect. The link between 17 and the physical drawing board requires two rules 
of interpretation', the elements of V are to be interpreted geometrically, as their 
name suggests, as points, and the number {B — A)^ is to be interpreted as the 
square of the distance (in some fixed unit) between the points A and B. We shall 
write \B — A\ = ^J{B — A)^. 



D C 




Fig. 1. Facts 1, 2 and 3. 




The interpretation of other algebraic entities will, as we shall see, be forced 
by these rules of interpretation and by certain fundamental physical facts like 
(see Figure 1) the following: 

Fact 1. A point P lies on the line segment between points A and B if and only 
if the distance from A to S is the same as the sum of the distances from A 
to P and from P to B', 

Fact 2. Points A, B, C and D are successive vertices of a parallelogram if and 
only if the point mid-way between A and C is the same as the point mid-way 
between B and D', 

Fact 3. A parallelogram with vertices A, B, C and Z? is a rectangle if and only 
if the distance from A to C is the same as the distance from B to D', 

Fact 4. The area of a square with side-length 1 is 1 . 

It is to be emphasized that these are statements about the physical world of 
the drawing board which we seek to model. Although, for example, a rectangle 
may be defined in various ways, we always find, upon measurement, that Fact 2 
and Fact 3 are confirmed, and in any model we shall certainly insist that the 
corresponding statements, formulated mathematically, be true. 

Fact 1 forces the interpretation of P = aA-|- f3B where a + [3 =1, a>Q and 
/3 > 0; for then we have 



P = aA + (3B P - A = P{B - A) and B - P = a{B - A), 
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and these equations imply that \P — A\ = (3\B — A\ and \B — P\ = a\B — A\ 
and hence that \B — A\ = \B — P| + |P — . If (1) is to hold we must interpret P 

as the point which divides the line segment from ^ to i? in the ratio j3 \ a. (See 
Figure 1.) The interpretation of a general affine combination (for which a+(3 = 1 
but either a or /3 may be negative) now follows easily. In particular, ^{A + B) 
is the mid-point of the line segment between A and B. 




We now find that even such a simple algebraic identity as 



i(A + C) + i(i? + Z?) 

]^{A + D) + \{B + C) 

has a geometric interpretation (see Figure 2) which, though obvious, is a non- 
trivial fact. Having shown that P = ^{A-\- B C) is the centroid of the triangle 
with vertices A, B and C we see that the algebraic identity stated in Section 1 
is a metric theorem which can be checked experimentally by making careful 
drawings and measurements. Each interpretable theorem of our algebra affords 
a further test of the efficacy of our model — if ever such a theorem, in its physical 
interpretation, fails to hold, we must reject the model. Every model is tentative, 
even Euclid’s. 



1 
2 

_ 1 
“ 2 



^-(A + B) + ^{C + D) 



3 Vectors 

Fact 2 (see Figure 1), together with the equivalence 

B-A = C-D^^{A + C) = ^{B + D) 

forces us to interpret the difference of two points B — A as the vector from A 
to B. We will use letters U, E, IF, X, ... from the end of the alphabet to denote 
vectors. 
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If P is a point then IP = P, a point, but no other real multiple of P 
can be a point; for if aP, with a yf 1, was a point Q, we would have 0 = 
a/(a — 1)P — l/(a — 1)Q G V. We do not need to interpret aP for a different 
from 0 and 1. Also a vector A — B cannot be a point P, since that would entail 
that = i(P + P) G P, contrary to the above. 

The equivalence 

B = A + X^X = B-A 

shows what it means to add a vector to a point. The identity 

C-D={A + C-D)-A 

shows that any point may be chosen as the starting point of a given vector. The 
identity 

(C-A) = (C-P) + (P-A) 

shows that the set of all vectors is closed under addition and also shows how 
addition of vectors is to be interpreted. The (usual) interpretation of multipli- 
cation of vectors by real numbers and closure of the set of vectors under this 
operation are easy to establish. Hence the set of all vectors is a linear space; it 
is two-dimensional, having as basis the pair of vectors B — A and C — A where 
A, B and C are linearly independent points. Note, though, that some of what 
follows is independent of dimension. 




Example. A familiar classical affine theorem (see Figure 3) is expressed by the 
implication: 

D = aA + j3B, E = aA + j3C ^ D - E = (3{B -C). □ 



Example. As an example of a modern affine theorem (involving the notion of 
vector in an essential way) we invite the reader to interpret and generalize the 
identity: 
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-(Ai + A2) - 2(^6 + ^1) 



-(^3 + A4) 
-{A5 + Aq) 



-(^2 + ^3) 

-(^4 + ^5) 



= 0 . 



□ 



The square of the vector X from A to S is the positive real number = 
(B — Ay, interpreted, of course, as the square of its length, and we shall write 
|X| = yjO. 



4 Perpendicularity 

From an algebraic point of view the Fact that 

{X + Yf = {X- Yf ^YX = -XY ^{X + Yf = 

is trivial, but its geometric interpretation is subtle and important. (It will be 
made completely clear when, in a moment, we show how a product XY is to 
be interpreted.) The empirical Fact 3 (see Figure 1) forces us to define per- 
pendicularity of vectors X and Y in such a way that X 1 .Y if and only if 
{X + Yy = {X — YY', or, equivalently, via 

X±Y ^YX = -XY. 

Then one of the above equivalences is the theorem of Pythagoras. 




Fig. 4. An isosceles triangle theorem. 



Example. The theorems suggested by Figures 4 and 5 are proved as follows: 

X^ = Y^ ^ {X + Y){X -Y) = -{X - Y){X + Y) 

^ X + Y A X - Y. □ 
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Example. (See Figure 6.) The ring identity 

{B - C){A - P)+ {C - A){B -P) + {A- B){C - P) 

= -{A - P){B -C)-{B- P){C -A)-{C- P){A - B) 

implies that the altitudes of a triangle intersect; for it shows that if B — CPA — P 
(that is, if {B — C){A — P) = —{A — P){B — C)) and C — APB — P then 
A — BPC — P. It is easy to devise a similar proof that the perpendicular bisectors 
of the sides intersect. Theorems arising in this way from identities may be said 
to be embodied in a Euclid ring. So far as I know the problem of characterising 
those theorems that are embodied is open. □ 



A 




A vector Y with 
vector X is 



1 is called a direction. The direction of a non-zero 



Example. (See Figure 7). The line with direction Y through a point B = A-\-X 
on the circle with centre A and radius |A| consists of all points P = A-\- X -\-tY 
with r S R. The condition for P to lie on the circle is that {X + or, 

equivalently, that 

T = 0or T = -{XY + YX)- 
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hence there is a unique intersection point if and only if Y^-X. Observe that 
XY + YX is a real number, since it equals {X + Y)^ — X^ — Y^. □ 




5 Parallelism 

Vectors X and Y are parallel, and we write V||V, if one is a multiple of the other. 
It is clear that if X\\Y then XY = YX. Let us prove the converse. Assuming 
without loss of generality that X and Y have equal lengths, we have 

XY = YX ^ {X - Y){X + V) = 0 
^ V = ±A 
^F||A. 

We have here used the Fact that any non-zero vector V is invertible (with 
inverse V~^ = and hence 



VW = 0 4^V = 0 or W = 0. 



6 Angles 

Whatever definition one adopts for the (oriented) angle X, Y between direc- 
tions X and Y one would like it to be such that whenever X,Y = X' ,Y' one 
has (as Figure 8 suggests) 



X-Y' II Y -X', 
X + Y' \\ X' + Y, 
X + Y' LY - X' 
and X' YY LX - Y'-, 
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equivalently, the following four equations should hold: 

XY - Y'Y - XX' + Y'X' = YX- YY' - X' X + Y'Y' 
XX' + XY + Y'X' + Y'Y = X'X + Y'Y' + YX + YY' 
XY + Y'Y - XX' - Y'X' = -YX - YY' + X'X + Y'Y' 
XY - Y'Y + XX' - Y'X' = -YX + YY' - X'X + X'Y' 




Adding, we see that these equations imply that 



XY = X'Y'. 



Accordingly, we define the angle X, Y between directions X and Y to be the 
product XY. 

As it is sometimes convenient to talk about the angle X, Y between non-zero 
vectors X and Y we define this to be the angle between their directions: 

|A||r|- 

Note that it is only for directions X and Y that the angle A, Y is the product 
AF. 

Example. (See Figure 9.) Let A^ = 1 = A^, F yf 0. Then 

A^F = X^ ^ XY = AiF 
^ A = Ai 
^ Ai||A. 

Here we have multiplied both sides of one equation by Y~^ . □ 

If A, F and Z are directions then 

(AF)(FZ) = XY^Z = XZ. 
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Note that an arbitrary angle VW between directions V and W has the form 
Y Z for a suitable Z, namely Z = YVW ; it will be shown in a moment that Z 
is a direction. Assuming this, we see that the set of all angles is closed under 
multiplication. Since 1 = is an angle and 

XYYX = 1 = YXXY, 

angles form a group under multiplication, with {XY)~^ = YX. In particular 
XY determines YX. 




Example. (See Figure 10.) 

= Xl, Y^ = Y^, XY = AiYi ^ {X - Yf = (Ai - Yif . 

This is a familiar result about congruence of triangles. The theory of congruence 
may be developed without introducing transformations and the same applies to 
similarity. □ 

We call 1 = XX the identity angle, —1 = —XX the straight angle, the 
complement of XY and —YX its supplement. Then, for example (see Figure 11) 



XYY{-X) = X{-X) = -1. 
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Readers may like to interpret the identity 

(XiX2)(X2X3)...(X„_iX„)(X„Xi) = 1 
where the Xj are directions. 

The geometric meaning of the conditions for perpendicularity and parallelism 
now become clear (see Figure 12, (a) and (b)): 

x±Y ^ xy^ = 
x\\Y-^xy = fy. 









-X X 
(a) 




/V 




(c) y' 



Fig. 12. Right angles and straight angles. 



Let us now prove that if X, Y and Z are vectors then so is XYZ. This holds 
trivially if X and Y are linearly dependent, and otherwise it holds for Z = X 
by virtue of the identity 

XYX = [{X + Yf - X - y 

and for an arbitrary Z = aX + PY because 

XYZ = aXYX + PXY^. 

Moreover if X and Y are directions then 

{XYZf = a'^ + aP{XY + YX) + 0^ = Z^- 

in particular, (a result that was needed above) if Z is also a direction then so is 
XYZ. This argument also entails that the group of angles is Abelian, since for 
directions X, Y and Z we have 



XYZ XYZ = 1 
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and hence 

ZXYZ = YX = YZZX. 

If X and U are arbitrary directions one easily checks that the vector U—XUX 
is perpendicular to X; hence there exists a direction Y perpendicular to X. 

7 Orientation 

Let X and y be a pair of perpendicular directions. If V and W are any other 
such pair, then XY = —YX and VW = —WV and so 

{XYf = -I = {VWf. 

We know that VW = XZ for a suitable direction Z, and hence that 

(XYf - (XZf = 0; 

because multiplication of angles is commutative this is equivalent to 

(XY - XZ)(XY + XZ) = 0 

or, indeed, to 

X(Y - Z)X(Y + Z) = Q. 

It follows immediately that either Z = Y ox Z = —Y and hence that either 
VW = XY or VW = X(-Y) = -XY. 

When we model a line as a one-dimensional affine space a choice of orientation 
(left and right) must be made. The situation in two dimensions is similar. Write 
i = XY. Then i and —i are called right angles, and one of them, say, i, is called 
positive and the other negative. 

Note that 

VW = ±i^ (VW)"^ = i^ = -l^ VW = -WV v±w. 

Multiplication of positive and negative right angles has the expected proper- 
ties (see Figure 12 (c)): 

= (-z)2 = -1, 

= (- 0 * = 1 : 

(- 1)2 = 1 . 

For any direction V the unique direction W such that VW is a positive right 
angle isVi: 

VW = i^w = Vi. 

Hence i anti-commutes with vectors: 

Vi = V(VVi) = -V(ViV) = -iV. 

The directions V and Vi are linearly independent (or not parallel) since 
VVi = ViV ^ ^ _y2- ^ y = 0. 

Hence they form a basis for the space of all vectors. 
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8 Trigonometry 

Consider the angle VW between directions V and W. Since (see Figure 13) 

W = aV + pVi ^ VW = a + (3i 

we see that VW is uniquely expressible as a linear combination of the angles 1 
and i. We define the cosine and sine of the angle VW = a + /3i to be the real 
numbers 

cos VW = a and sin VW = [3. 

Despite appearances these definitions are genuinely coordinate-free. 



Since also WV = a — /3i we have 

1 = VWWV = (a + /3i)(a - /3i) = = cos^ VW + sin^ VW. 

Hence — 1 < cos VW < 1 and — 1 < sin VW < 1; the first of these is the 
Cauchy- Schwartz inequality. Of course, cos 1 = 1, sin 1 = 0, cos i = 0, 
sin i = 1, cos(— 1) = —1, sin(— 1) = 0, cos(— i) = 0 and sin(— i) = —1. 

Also, for example (see Figure 13), 



The “addition formulae” are obtained by taking real and imaginary parts in 
a product 




Fig. 13. Sine and cosine of the angle VW. 



cos W{Vi) = cos(a — (3i)i = cos(/3 -I- ai) = sin VW. 



{VW){XY) = {a + (3i){-f + Si) 

= (ay — f3S) (a6 j3^)i 



to get 



cos{VW){XY) = cos VW cos XY - sin VW sin XY 



and 



sm{VW){XY) = cos VW sin XY -|- sin VW cos XY. 
If X and Y are arbitrary non-zero vectors, we have 




and 
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Fig. 14. The cosine and sine rules. 



Immediate consequences are the cosine rule (see Figure 14 (a)), 

{X + Yf = + Y^ + XY + YX 

= X^ + Y^ -2 |X| |F| cos Xy 

and the sine rule (see Figure 14 (b)): if X + Y + Z = 0, then 



XY -YX = YZ - ZY = ZX - XZ 



and hence ^ 

sin X, —Y sin Y, —Z sin Z, —X 

1^1 " 1^1 " m ■ 

The cosine rule and the Cauchy-Schwartz inequality together entail the tri- 
angle inequality, 

\X + Y\<\X\ + \Y\. 



An angle is called acute if its cosine is positive and obtuse otherwise; and it is 
called positive if its sine is positive and negative otherwise. It is also convenient 
to call the sign (1 or 0 or —1) of sin A, Y the orientation of X,Y. 

It is now easy to prove, for example, that at least two angles of any triangle 
are acute and to prove that the product of two acute positive angles is positive. 
Also one may relate the present definitions to properties of triangles by showing 
that for a triangle {A,B,C) with a right angle at B the cosine of the angle 
between the vectors B — A and C — A is while its sine has absolute 

value }§ 5 ||- 



9 Complex Numbers 

Let U, V, U\ and Vi be directions, with ^[/V = tjUiVi, where ^ > 0 and 77 > 0 
are real numbers. We will prove that ^ = rj and (hence) UV = UiVi. Indeed, we 
may write UV' = UiVi for a suitable direction V' and we have 

^UV = T]UV' = r]V' 
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here we first multiplied on the left by U and then squared. 

Any positive multiple ^UV of an angle may also be regarded as a ratio of 
vectors conversely, any ratio of vectors X~^Y (where A 0) is a 

positive multiple of an angle: 



|A|^|A||y| 



i- 



Clearly X does not determine either X or Y but, by what has just been 
shown, it does determine both the angle X,Y and the ratio |^. Equivalently, 



X~^Y = Af ^Yi A, r = Ai, Yi 




M 
1^1 1’ 



Thus X~^Y = Aj“^Yi if and only if the triangles with edge vectors A 
and y, Ai and Yi are directly similar. 

As we shall show in a moment, it is reasonable to call ratios of vectors X~^Y 
complex numbers. (Note that X~^Y = 0 <t4> E = 0.) The number is the 

absolute value of X~^Y and X,Y is its angle. The complex number also 

determines another complex number YX~^, its conjugate, given by 



YX~^ 




since y, A = (A, y) 

Real numbers are ratios of parallel vectors: 



r||A y = aA where a&TZ^ X~^Y = a€'Jl. 



Imaginary numbers are ratios of perpendicular vectors: 

y±A y = /3Ai where (3&n^ A-^y = € Tli. 



And complex numbers of absolute value 1 are ratios of vectors of equal length, 
or what we have been calling angles: 

1^1 = 1^1 ^ = 1 ^ X-^Y = 

It follows immediately from the corresponding facts for angles that every 
complex number may be written uniquely in the form A“^y, where A is a given 
non-zero vector, and that the non-zero complex numbers form an Abelian group 
under multiplication. (See Figure 15.) Also, since 

X~^Y + X~^Z = X~\Y + Z), 

the complex numbers form an Abelian group under addition. Thus the set C of 
all ratios of vectors is a field. 
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For X ^ 0 we have 

Y = aX + pXi ^ X~^Y = a + ^i; 

because X and xi are linearly independent, the pair of complex numbers, 1 
and i, form a basis for C as a linear space over TZ. The coordinates, a and j3, are 
the so-called real and imaginary parts, and 3[C], of the complex number 
(^ = X-^Y. 

Observe that 

rX-i = a-/3i 

and 

\X-^Yf = ^ = X-^YYX~^ = a^ + 

Since these properties characterize the complex field, our terminology is appro- 
priate. 

Conjugation, X~^Y i-^ YX~^, is an automorphism of C that carries i to —i. 



10 Area 

If we define the (oriented) scale- free area of a polygon {A\,A 2 , . . . , A}j) to be the 
quantity 



m[Ai, A 2 , . . . , Aff\ A\ A A 2 A 2 A A^ -I- ■ ■ ■ -I- A^ A Ai 

where AAB := ^{AB—BA), then we get a full theory of plane area, as expounded 
in [4] . The main properties are that 

— area is additive; 

— area is translation-invariant; 

— area of triangles is characterised, up to choice of scale, by three basic theo- 
rems of Euclid. 
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Note that area is vectorial (expressible in terms of vectors only), since 
m[A, B, C] ={B-A)A{C- A); 



this is one point where the value of the distinction we make between points and 
vectors is manifest. 

The space of scale-free areas of polygons in the affine plane is one- 
dimensional. What is new in the Euclidean context is that there is a natu- 
ral choice of scale. Choose a point O and direction X. Let Y = Xi, 0\ = 
0 + X ,02 = 0 + Y. Then: 

m[0,0i,02] =XAY= ^{XY-YX) = XY = i 

and m[0, Oi,Oi + O 2 , 02 ] = 2i. According to Fact 4 (of Section 2), the area of 
the unit square should be 1. We therefore define the (oriented, scaled) area of a 
polygon (Ai, A 2 , . . . , Ak) to be the real number 

fi[Ai,A 2 , . . . , Afc] := A 2 , . . . , Afc] 



For the area of a triangle {A, B, C) with V = B — A, W = C — A we have a 
familiar formula: 



^Ji[A,B,C] =^V AW 

1 VW - wv 

= i|F|.|W|sin 

= ibase[A, B, C].height[A, i?, C]. 

An arbitrary square with vertices Ai, A 2 = Ai -I- AT, A 3 = Ai+X+Xi, A 4 = 
Ai -I- Aiz has area 



^[Ai, A2, A3, A4] — ^[Ai, A2, A3] -|- /r[Ai, A3, A4] 

= i 3[A(A + Xi)] + i 3[(A + Xz)(-X)] 
= 

= A^. 



11 AfRne Maps 

Denote by V the set of all points, by £ the (real) linear space spanned by V, 
and by V the set of all vectors (a proper linear subspace of £). 
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A function f : V ^ V called an ajfine map if it preserves lines and ratios 
of distances along lines, or, equivalently, if 

f[aA + pB] = af[A]+/3f[B] 

for all A,BgV and a, /3 € 7?, with a + P = 1. (Actually it is only necessary to 
assume this for a and /3 both positive.) 

An affine map preserves parallelism of lines since {A, B, C, D) is a parallelo- 
gram if and only if ^(A + C) = ^{B + D). 

Any affine map f:V V can be extended naturally to a linear function 
f : C ^ C. To achieve this one must set 

/[oA] = a/[A] for all a G TZ, A £ V, 

and 

f[B - A] = f[B] - f[A] for all A,BgV. 

Since every element of C can be shown to be either a unique multiple of a 
unique point, aA, or a vector B — A, and since 

B - A = C - D => UA + C) = ^{B + D) 

^lif[A] + f[C]) = ^if[B] + f[D]) 

^ f[B]-f[A] = f[C]-f[Di 

this determines / on £ in a well-defined fashion. Now for arbitrary points A 
and B and real numbers a and /3, one has, in the case where a + P ^ 0, 



f[aA + PB] 



and, in the case where a + P 



af[A]+Pf[B], 

0 , 



P 



ct P 

P 



B)] 



ex P 



B] 



f[aA + PB] = /[((a + 1)A + pB) - A] 

= f[(a + l)A + pB]-f[A] 

= (a+l)f[A]+pf[B]-f[A] 
= anA]+pf[B]. 



From this it follows easily that / is linear on C. In future we use the term 
ajfine map for the unique extended map, / : C ^ C, and we call the restriction 
f : V V (which is also linear) its vector part. 



Example. For every vector W there is a translation t : V ^ V given by 



t[A] = A + W for every AgV. 
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This is an affine map since for a + /3 = 1 we have 

t[aA + ^B] = aA + PB + W = a{A + W)+ P{B + W) = at[A] + pt[B] 

and it leaves every vector X = B — A fixed, since 

t[X] = t[B] - t[A] = {B + W)-{A + W) = X. 

In a natural way the translations form a linear space isomorphic to V. □ 
Let f :V ^ V he affine. Since / 

f[A + X] = f [A] + f[X], 

is determined by its vector part together with its action on a single point A. 
Moreover, writing 



f[A + X] = {A + f[X]) + {f[A]-A) 

= (t o g)(A + X), 

we see that / is the composite with a translation / (by f[A] — A) of a map g, 
given by 

g[A + X]= A + f[X] for X € V, 
which is affine, since for a + /3 = 1 and C,D G V, we have 

g[aC + PD] = g[A + a{C - A) + P{D - A)] 

= A + f[a{C-A) + p{D-A)] 

= A + af[C-A] + pf[D-A] 

= a{A + f[C — A]) + P(A + f[D — A]) 

= ag[C] + Pg[D], 

which has a fixed point, since 

5 [A] = A, 

and which has the same vector part as /, since for X G V 

g[X] = g[A + X] - g[A] =g[A + X]-A = f[X]. 

This effectively reduces the analysis of affine maps to the consideration of linear 
maps on vectors. 

12 Similitudes 

A function f : V V is a similitude if it preserves ratios of distances, or, 
equivalently, if there exists A > 0 such that 



{f[B]-f[A]f = \\B-Af 
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for all points A and B. A similitude for which A = 1 is called an isometry^ thus 
isometries preserve distances. Obviously translations are isometries. 

Similitudes preserve lines, since a point C lies on the line segment between A 
and B if and only if 

\B-A\ = \B-C\ + \C-A\- 

Thus similitudes are affine maps. A similitude / stretches each vector X = B — A 
by the factor A : 



f[Xf = {f[B] - f[A]f = \\B - A)2 = A^A^. 



Example. There is a natural action of angles on vectors: for each angle YZ^ 
where Y and Z are directions, define / : V ^ V by 

/[A] = AFZ 

for every X G V. Clearly / is linear. The affine map whose vector part is such 
an / and which has a fixed point A is called the rotation about A through the 
angle YZ. Rotations are isometries since, using the Fact that angles commute, 
we have 



(XYZf = (XY)(ZX)(YZ) = (XY)(YZ)(ZX) = X^. 



In a natural way the rotations about A form a group which is isomorphic 
with the group of angles. □ 

Let / be a similitude, as above. For arbitrary vectors A and Y we have 



f[xr + f[X]f[Y] + f[Y]f[X] + f[Yr = (/[A] + f[Y]r 

= X^{X + Yf 

= X^(X^ + XY + YX + A^), 



and hence 



cos /[A], /[A] 



/[A] /[A] + / [A] /[A] 

2|/WII/[A]| 



= cos A, A. 



It follows that / preserves perpendicularity of vectors. Since, also / 



sin^ /[A], /[A] = 1-cos^ /[A], /[A] = sin^ A, A, 



multiplies areas (i |A| |A| 



sin A, A 



) by the factor A^. However, / may reverse 



orientations. 

Let A and A be directions with A A = i, a positive right angle. From the 
above we know that 



/[A] /[A] + / [A] /[A] = A^(AA + AA) 
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and either 

f[X]f[Y] - f[Y]f[X] = X\XY - YX) 
or 

f[X]f[Y] - f[Y]f[X] = X\YX - XY). 
In the first case we must have 



f[X]f[Y] = X^XY, 

and hence 

f[5^[Y] = 

while in the second case we have 

f[X]f[Y] = X^YX 

and hence 

/[x]7[y] = y;^. 

In the first case, / preserves all angles and we have what is called a direct 
similitude', for every angle has the form XZ for some direction Z = otX + ^Y , 
and 



f[X]f[Z] = f[X]{af[X]+Pf[Y]) 
= aX'^X'^ + PX^XY 
= X^XZ. 



Similarly, in the second case / reverses all angles and we have what is called 
an indirect similitude. 

Because 



x-y = jllyy-. 



it must be the case that / carries triangles to similar triangles, preserving ori- 
entation if direct and reversing orientation if indirect. 

Let ^ be a point. Define g V ^ V hy 



g[B]=A+j{f[B]-f[A]), 



for every point B. Clearly g is an isometry with fixed point A, and 



f[B]=A + X{g[B]-A) + {f[A]-A) 

= {todog)[B\ 

for every point B, where t is translation by f[A] — A and d : V ^ V is the 
similitude with fixed point A, given by 



d[B] = A + X{B-A) 



for every point B, and called a dilation. This effectively reduces the problem of 
finding all similitudes to that of finding all isometries. 
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13 Isometries 

The action of a direct isometry / on vectors is completely determined by its 
action on a single vector, since the identity 

f[X]f[Y] = XY 

entails that / carries each X e V to 

f[X] = XYf[Y]-^ 

where Y is some fixed non-zero vector. Indeed 

1 Yf[Y] Yf[Y] 

is an angle, and so the direct isometries are identified as precisely the maps 
obtained by composing a translation with a rotation. 

Example. Consider an indirect isometry / which has a fixed non-zero vector Y, 
which we take, without loss of generality, to be a direction. Since /[X]/[y] = 
Y X, the action of / on vectors X is given by 

f[X] = YXflY]-'^ = YXY. 

If X = aY + (3Y i then 

f[X] = aY^ + fiY^iY = aY - j3Yi. 

If / has a fixed point A we call / the reflection in the line through A with 
direction Y. Reflections are idempotent, since 

f[X] = Y^XY^ = X, 

and the product of the reflections in the lines through A with directions Y and Z 
is the square of the rotation about A through the angle Y Z, since 

Z{YXY)Z = X{YZ){YZ). 

The natural induced action of reflections on angles is conjugation: reflection 
in a line with direction Y carries the angle Y X into 

{YYY){YXY) = XY. □ 

Every indirect isometry is determined by its action on a single direction Y, 
since, for every vector X, 

f[X] = YXf[Y]-\ 

and is the composite of a translation, a reflection and a rotation, since 

f[X] = {YXY){Y-^f[Y]-^) = {YXY){Yf[Y]). 

Every rotation is the composite of two reflections. To see this, let / be a 
rotation through the angle Y Z, where Y and Z are directions. If U is the direction 
of i(y -k Z) then YU = UZ, and hence f[X] = XYZ = {XY){UY)U = 
U(YXY)U. 
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14 Conclusion 

The simplest concrete realisation of our formalism is as the ring generated by 
the real numbers and elements O, X and Y, subject to the conditions that real 
numbers commute with the other generators, = 1, = 1 and YX = —XY, 

but otherwise free. It is straightforward to verify the axioms, with V = {0+aX+ 
(3Y : a € TZ, P € TZ}. Of course many elements have no geometric interpretation, 
but our exposition shows that this is not a problem. We have shown how standard 
notions of Euclidean plane geometry are realised, and indeed other notions, not 
mentioned above, such as line vectors, are also realised. 

Note that direct use of such a concrete realisation, rather than our axiomatic 
approach, would allow coordinate-based proof of theorems, including those in- 
volving points. However, a main point of the paper has been to suggest that it is 
necessary — and possible — to move beyond a theorem-proving point of view if 
we are to build more autonomous systems that internalize geometric knowledge 
in a way that will satisfy the desiderata presented in our first paragraph. The 
Euclid ring axiomatization supports direct algebraic reasoning with geometric 
concepts. 

In a Euclid ring, points, vectors and complex numbers are distinct geometric 
entities, with rich interplay between them. Both the identification of points with 
vectors (made in expositions of “vector geometry”) and with complex numbers 
(made in expositions of “complex geometry”) are imgeometrical and obscure 
those relationships; our treatment shows that these identifications are unnec- 
essary. At the same time, the isomorphisms between V and A + V (for any 
“origin” A), and between V and TZ + iTZ, explain why those identifications work, 
to the extent that they do. 

It is not suggested that the approach to the Euclidean plane presented here 
supercedes other approaches. On the contrary, the Euclid ring formalism provides 
a context in which other approaches have their places. I conclude with some 
remarks on connections with other approaches. 

(a) Complex numbers. The Argand diagram was of great importance histor- 
ically because it helped to make complex numbers respectable, and it remains 
the way that we picture complex numbers. Nevertheless I want to suggest that 
this picture is wrong in roughly the same sense that it is wrong to identify two 
structures which are isomorphic but not naturally isomorphic. A drawing board 
(physical plane) has no imaginary axis (or real axis, or origin, for that matter). 
From a geometric point of view, complex numbers are not points but ratios of 
vectors (or positive multiples of angles). Conjugation is not reflection of points in 
the real axis but a map which carries each positive multiple of an angle onto the 
same positive multiple of the inverse angle. Of course if we choose an origin O 
and direction X, every point P is uniquely expressible in the form 

P = 0 + aX + pXi, 

and the map P X~^{P — O) = a -I- /3* is a bijection between points and 
complex numbers, and there is an induced correspondence between reflection in 
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the real axis and conjugation. Reversing the point of view, one may say that a 
major advantage of the Euclid ring approach is that it enables one to deal with 
angles algebraically in the same kind of way that we deal with vectors. 

(c) Coordinates. People who use geometry, such as engineers and architects, 
rarely need explicit coordinates. As with (a), application of Occam’s razor would 
excise coordinates from geometry. Nevertheless coordinates are of interest, since 
to date the most successful geometry theorem provers and systems for the cre- 
ation and manipulation of geometric objects are based on coordinatisation. The 
Euclid ring approach evidently supports resort to coordinates for proofs when 
necessary. At the same time it is reasonable to hope that it may be feasible to 
build geometric reasoning systems that can produce simple algebraic proofs like 
those given in this paper. Current work of D. Wang [9] and others, including a 
group lead by the author, may lead to such systems. 

A point to be noted is that whereas the field of complex numbers, by virtue 
of its algebraic completeness, plays a crucial if somewhat inexplicable role in the 
coordinate-based proof methods for “real” Euclidean geometry, complex num- 
bers appear naturally as actual geometric objects in the treatment presented in 
this paper. 

(c) Synthetic geometry. There is something very appealing about traditional 
synthetic proofs and they may profitably be mixed with algebraic arguments. For 
example after the result (proved here) that the angles at the base of an isosceles 
triangle are equal, one can prove in rapid succession by synthetic arguments a 
sequence of interesting theorems: that the angle subtended by a chord at the 
centre of a circle is twice the angle at the circumference, that opposite angles of 
a cyclic quadrilateral are supplementary, that the product of the distance from 
a point to the points of intersection with a given circle of a line through the 
point is constant. One might even choose to carry this approach to the limit 
by proving as rapidly as possible a set of synthetic axioms (as theorems) and 
thereafter using only synthetic methods. As we have shown, the Euclid ring also 
provides a natural setting in which to introduce transformation geometry. 

(d) Inner and outer products. Of course X i— > |A| is a Euclidean norm on V 
and by setting 

X.Y = iR[XY] = i(|A -h - |A|^ - |F|^) 
we obtain the associated inner product. Moreover by setting 
X AY = i 3[AF] = ^(AE - YX) 

(and X AY A Z = 0) we get an exterior product of vectors. Many of our results 
may be expressed in terms of these products, but this usually amounts to split- 
ting complex numbers into real and imaginary parts, something which is often 
best avoided. 

This paper is a slightly updated version of [3] . It aspires to bring to bear upon 
Euclidean geometry with points and numbers as the fundamental objects, the 
power of ideas which were implicit already in the work of Grassmann, Hamilton 
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and Clifford, and have subsequently been extended by many authors (see, for 
example, [5]). The modern theory of Clifford algebras was established by Elie 
Cartan, Hermann Weyl, Claude Chevalley, Marcel Riesz and others, and thor- 
ough expositions may be found, for example, in the books [7], [8] and [6]. The 
dependence of the paper on the work of these authors is acknowledged. 

The material presented in this paper is very simple, even trivial. But that is 
precisely the point: if an automatic reasoning system is to be able to “do” plane 
Euclidean geometry, and to call upon this ability when needed, it should use a 
simple formalism that is amenable to computation and that, at the same time, 
is sufficiently expressive to fully capture the riches of the world that Euclid gave 
us. That is what the Euclid ring approach attempts to do. 
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Abstract. In this paper a method for mechanical theorem proving in ge- 
ometries is proposed. We first discuss how to describe geometric objects 
and geometric relations in 2D and/or 3D Euclidean space with Clifford 
algebraic expression. Then we present some rules to simplify Clifford al- 
gebraic polynomials to the so-called final Clifford algebraic polynomials. 
The key step for proving the theorems is to check if a Clifford algebraic 
expression can be simplified to zero. With the help of introducing coordi- 
nates, we can prove mechanically most of the geometric theorems about 
lines, conics, planes and so on in plane and/or solid geometry. The proofs 
produced by machine with our method are readable and geometrically 
interpretable. Finally, some interesting examples are given. 



1 Introduction 

It has been a hard work left since the time of Euclid to find a mechanical method 
to prove difficult geometric theorems to make learning and teaching geometry 
easier. In history many mathematicians, including Leibniz, Hilbert and others, 
have tried their best in this field. Modern computer technology and science make 
it possible to produce proofs of geometric theorems mechanically. Since nineteen 
fifties, lots of scholars have studied mechanical theorem proving in geometries, 
but they did not make much progress in proving relatively difficult geometric 
theorems until nineteen seventies. 

In 1978, a breakthrough came with the work of Wu [6], who introduced an 
algebraic method which, for the first time, can be used to prove hundreds of 
geometric theorems mechanically. With Wu’s method, many difficult theorems 
whose traditional proofs need an enormous amount of human intelligence can be 
proved on computer within seconds. 

Motivated by Wu’s method, Chou, Gao and Zhang developed an area method 
that can produce short and readable proofs for hundreds of geometric statements 
in plane and/or solid geometry [1]. This is another great progress in this field. 

In 1997, combining Clifford algebraic expression with Wu’s method, Li and 
Cheng proposed a complete method for mechanical theorem proving in plane 
geometry; this brought people new enlightenment [-5]. Soon after this, Fevre 
and Wang gave a method for mechanical theorem proving based on rewriting 
rules for Clifford algebraic expressions [2] . Recently, Boy de la Tour and others 
have proposed a rewriting system for simplifying Clifford algebraic expressions in 
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3D and investigated its theoretical properties [3] . Having some similarity to their 
work, ours presented here emphasizes various applications to reasoning problems 
in plan and solid geometry. 

In the practice of theorem proving, the first task is to represent geometric 
objects and relations with Clifford algebraic expressions. Then simplifying these 
expressions to simpler forms with certain rules. To prove a theorem, one needs 
to simplify the conclusion expression with the condition expressions and the 
simplifying rules to see if the final expression is zero. With this method we 
have proved, besides theorems about points, lines, planes and so on in plane 
and solid geometry, many theorems about conics which were never considered 
in mechanical theorem proving before. 

In addition, by changing the metric, this method can be used to prove theo- 
rems in non-Euclidean geometry. 

2 Preliminaries 

Let P be a vector space over the field R of real numbers and Q : P — > i? a 
quadratic form. The quotient of the tensor algebra ®(F) by the two-sided ideal 
generated by the elements of the form x®x — Q{x), x G P, is called the Clifford 
algebra of (V,Q), denoted by G(V,Q) or GiY). The product in G{V), denoted 
by juxtaposition, is called the geometric product. Induced by 0(C), G(V) is a 
graded associative algebra. 

Let 0 fy H G G{V). A is called a r-blade if A = a\a 2 ■ ■ ■ where 
ai G F, aittj + ajtti = 0, Vf fy j. 

The symbols Ar,Bs, ■ ■ ■ , represent a r-blade A, a s-blade B, and so on. All r- 
blades generate linearly a vector subspace GriV) of G{V), whose elements are 
called r-vectors. We have G{V) = where Go{V) = R, Gi{V) = V. 

So 

OO 

VA€G(V), A = ^(A)„ 

i=0 

where (A)i G Gi(V) is the i- vector component. 

When r, s fy 0, (ArBgjir-sl is called the inner product of Ar,Bs, denoted 
by Ar ■ Bs, {ArBs)r+s is called the outer product of A^, Bs, denoted by Ar A B s', 
otherwise we define 

Aq ■ Bg = Bg ■ Aq = 0, Aq a Bg = Bg A Aq = A^Bg. 

The definitions can be extended linearly to G{V) x G(V). 

The commutator product of A, B a G{V) is A x B = ~ 

The reverse of A G G{V) is denoted by Al; in which 1 is a linear operator 
satisfying A|. = a^ar-i ■ . . cq for any A^ = a\a 2 ■ ■ ■ dr, where G V . 

The scalar product of A, B G G{V) is A * B = {AB)q, the magnitude of 
A G G{V) is defined as |A| = \/\Ai * A|. 

See [4] for details. 
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3 Plane Geometry 

In this section, we consider the Clifford algebra in the Euclidean plane E^. The 
oriented segment from point A to point S, denoted by AB, is called a vector, 
its magnitude is denoted by \AB\. We denote by I2 the unit 2- vector, and hence 
Gih) = G{E'^), Gi{l2) = E"^. Denote = cos6 + I2 sin0, V0 € R. 

Let line{A,a) denote the line passing through the point A and parallel to 
the vector a; circlel(0, P) denote the circle with the point O as its center 
and passing through the point P; circle2(0, r) denote the circle with center O 
and radius r; parabola{0, F) denote the parabola with vertex O and focus F; 
ellipse{0, F, a) denote the ellipse with center O, focus F and major semi-axis a; 
hyperholaiO, F, a) denote the hyperbola with center O, focus F and real semi- 
axis a; 



3.1 Description of the Method 

In this section, we restrict ourselves to constructive statements in plane geometry. 

By a geometric object we mean a point, a scalar, a vector, a multi- 
vector, . . . , in the plane. A construction means a way to introduce some new 
geometric objects based on some geometric objects introduced in advance. 

Generally, a geometric statement is composed of a sequence of constructions 
and a conclusion that can be written as a Clifford algebraic expression of some 
geometric objects. 

Definition 1 . A construction is one of the following operations: 

Cq) (point(X)) Take an arbitrary point X on the plane. 

Cl) (onJine(X,A,a,t)) Take an arbitrary point X online{A,a). 

C2) (on-cirl(X,0,P,9)) Take an arbitrary point X on circlel{0, P). 

C 3 ) (interJineJine(X,A,a,B,b)) Point X is the intersection of line{A,a) 
and line{B, b). 

C 4 ) (interJine-cirl(X,A,a,0)) Point X is the intersection ofline{A,a) and 
circlel{0, A) other than point A. 

C4) (inter Jine-cir2(X, A, a, O, r, X' ), where X' is optional) Points X and X' 
are the intersections ofline{A,a) and circle2{0,r). 

C 5 ) (inter _cir2-cir2(X , 0\,ri, O2, ^ 2 , X' ), where X' is optional) Points X and 
X' are the intersections of circle2{Oi,ri) and circle2(02,r2) ■ 

Cq) (on-para(X,0,F,p,t) where p = \OF\) Take an arbitrary point X on 
parabola{0, F). 

C 7 ) (paraJ,an(r,0,F,p,P), where p = \OF\) Vector r is the tangent vector 
at point P to parabola{0, F), where point P is on the parabola. 

Cg) (tan_para(X,0, F,p,a), where p = \OF\) Point X is on parabola{0, F), 
whose tangent vector is a. 

Cg) (inter_paraJ.ine(X,0,F,p,B,a), where p = \OF\) Point X is the in- 
tersection of line{B,a) and parabola{0,F) other than point B, where 
point B is on the parabola. 



114 



Haiquan Yang et al. 



Cio) (inter_paraJinepassfocus(X,0, F,p,a, X' ), where p = \OF\, X' is op- 
tional) Points X and X' are the intersections ofline{F,a) and parabola 
(0,F). 

Cii) (on-elli(X,0,F,a,b,c,6), where c= \OF\, b= -\/a^ — c^) Take an arbi- 
trary point X on ellipse{0 , F, a) . 

C12) (elli-tan(T,0, F, a,b, c, P), where c = \OF\, b = — (?) Vector r is 

the tangent vector at point P to ellipse{0 , F, a) , where point P is on the 
ellipse. 

C'la) (tan_elli(X,0,F,a,b,c,cr,X' ), where c = \OF\, b = X' is op- 

tional) Points X and X' are on ellipse{0 , F, a) , whose tangent vector is 
a. 

C14) (inter_elliJine(X,0, F, a,b, c, P,a), where c = \OF\, b = y/ a^ — (?) 
Point X is the intersection of line{P, a) and ellipse{0 , F, a) other than 
point P, where point P is on the ellipse. 

C15) (on-hyper(X,0, F,a,b,c,t), where c = \OF\, b = yj (? — a?) Take an ar- 
bitrary point X on hyperbola(0, F,a). 

C'le) (hyper_tan(r,0, F,a,b,c, P), where c= \OF\, b= y/ 1 ? — 0 ?) Vector t is 
the tangent vector at point P to hyperbola{0 , F, a) , where point P is on 
the hyperbola. 

Cir) (tan_hyper(X,0, F,a,b,c, a, X' ), where c = \OF\, b = y/ (? — a^ , X' is 
optional) Points X and X' are on hyperbola{0,F,a), whose tangent 
vector is a. 

Cis) (inter_hyperJine(X,0,F,a,b,c,P,a), where c = \OF\, b = y/ of — c^) 
Point X is the intersection of line{P,a) and hyperbola{0, F, a) other 
than point P, where point P is on the hyperbola. 

The points X , X' or the vector r in each of the above constructions are 
said to be introduced by that construction. 

Definition 2. A constructive statement is represented by a list 

S (ci , C2 ; ■ • ■ j , G ) J 

where 

— Ci, i = 1 , . . . , fc, are constructions such that the geometric objects introduced 
by each Ci must be different from the geometric objects introduced by Cj,j = 
1 , . . . , z — 1, and the other geometric objects occurring in Ci must be introduced 
before; and 

— G = (i?i,i?2) where E\ and E 2 are the Clifford algebraic expressions of 
some geometric objects introduced by the constructions Ci and Ei = E 2 is 
the conclusion of S. 

By the propositions in [8], the geometric objects introduced by the con- 
structions can be represented by a Clifford algebraic expression of the known 
geometric objects. For testing whether a constructive statement is true or not, 
we simply, from the last construction Ck to the first one Ci, substitute the ex- 
pressions of the introduced geometric objects into the conclusion’s expression, 
and then simplify the result. 
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3.2 Algorithm and Implementation of Q{E^) 

A Clifford algebraic expression is called basic if it is a constant symbol denoting 
some real number, a constant symbol denoting some vector, or the 
unit 2- vector l2 - In particular, a basic vector, denoted by a, b,c, . . ., is a constant 
symbol denoting some vector; a basic scalar, denoted by a, /3, A, . . ., is a constant 
symbol denoting some real number. 

Let A, B,C, . . . ,x,y, z denote the Clifford algebraic expressions which are 
composted of basic scalars, basic vectors and I2 through Clifford algebraic op- 
erations. 

Among Clifford algebraic operations, the most useful one is the computation 
of the grade of a Clifford algebraic element. So we have the following algorithm. 

Algorithm 1. Compute the grade of x if x is a homogeneous multivector, oth- 
erwise returns an error message. 

Cgrade : =proc (x) 

if a; is a basic vector then RETURN (1) 
elifx = /2 then RETURN (2) 

elif x = A-B then RETURN(|Cgrade (A) —Cgrade (S) |) 
elif x = AAB then RETURN (Cgrade (A) -TCgradeCi?)) 
elif x = A-\-B then 

if Cgrade (A) =Cgrade(B) then RETURN (Cgrade (A) ) 
else ERROR 
fi 

elif X = \A then RETURN (Cgrade (A) ) 
elif X = A~^ then RETURN (Cgrade (A) ) 
elif X = AB then ERROR 
else RETURN (0) 
fi 
end 

where A, B are not scalars. 

We consider the inner and outer products as the basic products and will 
express other products with them. So we have the following algorithms. 

Algorithm 2. Compute the inverse of x if it exists, otherwise return an error 
message. 

Inverse ; =proc (.x) 

if Cgradeta;) = 0 then RETURN (—) 

X 

elif Cgrade(a;) = 1 then RETURN ( ) 

X ■ X 

I9 

elif Cgrade(a;) = 2 then RETURN ( ) 

X ■ I2 

else ERROR 
fi 
end. 
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Clearly, the algorithm Inverse cannot deal with all reversible elements, but 
it is sufficient for our purpose. 

Algorithm 3. Represent a geometrie product with inner and outer products. 
Eval^ ; =proc (x, y) 

if a; = X]i=i y = (r > 1 or s > 1 ) 

then RETURN (E[=i Ei=i Eval^(A„ 
elif X = AB then RETURN(Eval_g(Eval_g(A, S), y)) 
elif y = AB then RETURN (Eval_g(x, Eval_g(A, i?))) 
elif Cgrade(a;) = 0 or Cgrade(y) = 0 then RETURN (a; * y) 
elif Cgrade(a;) = 1 then 

if Cgrade(?/) = 1 then RETURN(x • y + a; A y) 
else RETURN(a; • y) 
fi 

else RETURN(a;-y) 
fi 
end 

where a; * y denotes the scalar multiplication, and A, B are not scalars. 

Clearly, if the three conditions at the beginning of the algorithm are not sat- 
isfied, then both x and y must be homogeneous multivectors, and the algorithm 
Cgrade can compute their grades. So the algorithm EvaLg is correct. 

With the above algorithms, we can represent various products that appear 
in our applications with inner and outer products. 

A Clifford algebraic expression is called a Clifford algebraic polynomial if it 
does not contain any inverse of non-scalars and any denominators involving non- 
scalars. What follows will mainly deals with the Clifford algebraic polynomials. 
We will use the following five kinds of rules and formulas 
Rules 1: Both inner and outer product are bilinear. 

Rules 2: For arbitrary scalar X G R and arbitrary multivector A G G{E^): 

XaA = AaX = XA, A-A = A-A = 0. 

Rules 3: For arbitrary r- vector A^ and s- vector Bg -. 

Ar-Bg = • Ar, if r < s. 

ArABg = {-lY’^BgAAr. 

Rules 4: For arbitrary vectors a,b G E'^: 

( 1 ) a A {b ■ I2) = {a ■ b)l2 

(2) a-{b-l2) = {aAb)- 

(3) (a • I2) ■ l2 = -a 

(4) ((a A b) ■ 12)12 = -{a A b) 

(5) l2-l2 = -1 

For any vectors a,b,c,d G E"^: 

a ■ {b A c) = {a ■ b)c — (a ■ c)b 

(a A b) ■ (c A d) = (b ■ c)(a ■ d) — (b ■ d)(a ■ c) 



Rules 5: 
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Algorithm 4. Expand a Clifford algebraic polynomial x by Rules 1, Rules 2, 
and Rules 3. 

Eval_i_wl : =proc (a;) 

stepl: expand x by bilinearity to get a sum, each term of 

the sum does not contain any addition and subtraction. 
step2: process the scalar factors in each term by Rules 2. 
step3: reorder the factors in each term by Rules 3. 

end. 

A Clifford algebraic polynomial is called expanded if it has been processed by 
Eval_i_wl. In the sequel, we will assume that the algorithm Eval_i_wl is called 
automatically when it is required. 

We now consider the forms of the terms in an expanded Clifford algebraic 
polynomial. 

Class 1 : the basic scalars a, / 3 , . . . , the basic vectors a,b,c,d, . . . , and the unit 
2 - vector I2. 

Class 2 : the elements which are obtained by taking inner or outer product with 
factors in Class 1 once. For examples, the scalars a b,c d, the vectors 
a ■ l2,b ■ I2, the 2 - vectors a Ab,c A d,. 

Class 3 : the elements which are obtained by taking inner or outer product 
with factors in Class 1 or Class 2 once. All of these elements can be 
simplified to a sum by Rules 4 and Rules 5 , the new form occurring 
in these terms is the scalar (a A b) ■ l2- 

No other new forms will be produced by this processing. So the forms of 
Clifford algebraic terms are 

vectors: a, b, c, d, ; a /2, h /2, c /2, d • /2, 

scalars: a ■ b, cd,...; {aAb)-l2, {c A d) ■ I2, ■ ■ ■ ; a, / 3 , . . . 

2 - vectors: I2] a A b, cAd, 

and an expanded Clifford algebraic polynomial is a sum of products of such 
elements and the scalars. 

From the above, there are five forms of terms in an expanded Clifford alge- 
braic polynomial 



A, A/2, A(a A b), Aa, A(a • /2), 



where 



A = ai . . . a^{ai • bi) . . . {aj ■ bj){{ci A di) ■ I2) ■ ■ ■ ((ck A dk) ■ h)- 
Rules 5 ': 

((a A b) • / 2 )((c A d) ■ I2) = (b • d){a • c) — (b • c)(a • d) 

((a A b) • / 2 )(c • I2) = (b • c)a - (a • c)b 

((a A b) • l2)c = {a ■ c){b ■ I2) - (b • c)(a • I2) 

{{a A b) ■ / 2 )(c Ad) = {b ■ c){a ■ d)l2 — (b • d){a ■ c )/2 
((a A b) • 12)12 = -(a A b) 
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With the above rules, we can simplify a Clifford algebraic polynomials to a sum 
of the following six forms 



SI 


tti . 


. ai{ai 


bi). 


•(% 


bj) 




S2 


ai . 


. ai{ai 


bi). 


•(“j 


bM< 


: A d) • I2) 


S3 


ai . 


. ai{ai 


bi). 




bj)h 




S4 


ai . 


. ai{ai 


bi). 




bj){c 


Ad) 


S5 


ai . 


. ai{ai 


bi). 




bj)c 




S6 


ai . 


. ai{ai 


bi). 




bj)(c 


■I2) 



where oi , . . .ai are basic scalars, ai, . . . aj, bi . . . bj, c, d are basic vectors. 

Definition 3. The forms SI, S2, . . S6 will be called final forms. A Clifford 
algebraic polynomial is called a final Clifford algebraic polynomial if its terms 
are all final forms. 

Theorem 1. There is an algorithm to simplify a Clifford algebraic polynomial 
in Q{l 2 ) to a final Clifford algebraic polynomial. 

Algorithm 5. Simplify an expanded Clifford algebraic polynomial x by Rules 4 
and Rules 3 to a final Clifford algebraic polynomial. 

Eval_i_w2 : =proc (.x) 

stepl: Let the terms and/or the factors with grades greater 

than 2 vanish. 

step2; Simplify the 2-vectors in x which are not I 2 by formula 
u = — {u- 12)12 > that is in the reverse order of rule (4) 
in Rules 4. 

stepS; Simplify x by other rules in Rules 4. 
step4; Simplify x by Rules 5'. x will be a final polynomial, 
end. 

3.3 Algorithm and Implementation of Prover 

We first give Maple procedures for constructions. Each construction will return 
a list that is composed of the introduced geometric objects and their Clifford 
algebraic expressions proposed by the propositions in [8]. 

Examples. Procedure on_para(A, O, E,p,f) returns 

7JF +2 

lOX,-{-+tl2)]. 

p 4p 

Procedure para_tan(r, O, _F, p, P) returns 

. OF. (OFAOP)l2 

P’ ( o 2 

p Zp^ 

Procedure inter_paraJinepassfocus(X, O, P,p, a) returns 



+ / 2 )]. 



A Clifford Algebraic Method for Geometric Reasoning 



119 



Procedure inter_paraJinepassfocus(X, O, P’jP, a, AT') returns 



[OX, OF 



2p^ 

p\a\ + OF ■ a 



[OX', OF 



2p^ 

p\a\ — OF ■ a 



Let A be a list returned by a construction, y a Clifford algebraic expression. 
The Maple procedure Eliminate(L, y) will substitute L[ 2 ] for L[l] in y and return 
the simplified result, where L[l] and L[ 2 ] are the first and the second element of 
the list L respectively. 

Let S = (ci, C2, . . . , Cfe, G) be a constructive statement, (listi, list2, . . ., list,.) 
the sequence of lists returned by the constructions (ci, C2, . . . , Cfc), y a Clifford al- 
gebraic expression about the points and vectors introduced by the constructions 
(ci, C2, . . . , Cfe). Then y can be simplified by the following algorithm. 



Algorithm 6. Simplify y hy listi, Ust 2 , ■■■, listr- 

PreProver;=proc (listi, list 2 , ..., list^, y) 
for i from r by —1 to 1 do 
y := EliminateClisti, y) ; 
simplify y by Eval_i_wl and Eval_i_w2; 
od 

RETURN (y) 
end. 



In the implementation of our program, we use a Maple procedure PreDeal 
to reduce the computing time and to shorten the proof of geometric theorems. 

Algorithm 7. Do some pretreatment of listi, Ust 2 , . . ., listr- 
PreDeal ;=proc (listi, list 2 ,. . .,listj.) 

simplify listi [2] by Eval_i_wl and Eval_i_w2; 
for k from 2 to r do 

y := PreProverdisti,. . ., listfe_i, list/; [2] ) ; 
list/; ; = [listi; [1] , y] 
od 
end. 

From the above, we have the following Prover. 

Algorithm 8. Test whether a constructive statement 

S = {ci,C 2 ,...,Ck,{E,F)) 

is true or not, and print the proof for S. Let (listi, Hst 2 , ■■■, listr) be the lists 
returned by the constructions (ci, C2, . . . , Ck)- 

Prover ;=proc (listi, list 2 ,..., list^, E,F) 

PreDeal (listi , list 2 , . . . , list,.) ; 
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for i to r do print ' ' list^ [1] =listi [2] ) ’ ’ od; 

E:= PreProver (listi, list 2 , list^, E) ; 

and print the steps; 

E:= PreProver (listi, list 2 , list^, F) ; 

and print the steps; 
if E = F then RETURN (' True ’ ) 
else 

y : =Eval_Normal — _F) ; and print ‘‘y = E — F’’ 
if y = 0 then RETURN ( ‘ True ’ ) 
else RETURN ('False’) 
fi 
fi 
end. 

In this algorithm, E = F means that E and F are identical expressions, y = 0 
means y and 0 are identical expressions too; and the procedure EvaLnormal() 
normalizes the final Clifford algebraic expression by introducing coordinates. In 
fact, Eval_normal() is never used in all examples we met, even those cannot be 
implemented by our program. 

Theorem 2. Algorithm Prover is correct. 

Proof. Obviously, before the first if sentence is called, the expressions E and F 
will only involve free variables. So the statement S is true if and only if E 
equals F, or E — F equals 0. 

If E and F are not identical expressions in form, we compute y, the normal 
form oi E — F w.r.t. the introduced coordinates. Since normal form is unique, 
E — F equals 0 if and only if y = 0. 

3.4 Some Examples 

In this subsection, we give some examples to illustrate our general method. See [7] 
for more examples. 

Let O be the origin, we denote vector OF, OA .. .hy F ,A ... . respectively 
in the proofs for the geometric statements. Our program will use the following 
procedures. 

Procedure para_const{0,F,p) returns list [p, |OE|]. 

Procedure ellLconst(0, F, a,b, c) returns [c, |OE|], [b, y/ — c^]. 

Procedure hyper _const{0 , F, a, b, c) returns [c, |OE|], [b, \J (? — of]. 

Procedure hyper _sym{ri , O, F, a, b, c, T 2 ) returns [ri, aOF + bOFl 2 ], [t 2 , aOF 
— 6 OE/ 2 ]. They are the asymptotic vectors of the hyperbola{0, F, A), and T 2 is 
optional. 

Example 1. In geometrical optics, the light beams parallel to the main axis of a 
parabola will focus when they are reflected by the parabola. 

This example can be described in the following constructive way. 
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O 




Fig. 1 



Constructions: 
point(0, F); 
para_const(0, F,p)-, 
on_para(P, O, F, p, t); 
para_tan(ei, O, F,p, P). 

Conclusion: 



OF A 6i 6i A FP 
OF - El El ■ FP 



Our program prints the following. 
PrcDeal: 



tF + 4F ■ hVT ■ F 



I 2 ■ (ei A FP)l2 

e^FP _ 

^ (/2 ■ (F A ei) + /2 • (ei A P)) I 2 

— F ■ El + El ■ P 



p = V F ■ F 




P= 

_ 4/1/ _ 

2F- hVF- F + tF 

El = — — 

2F-F 



Proof: 



I 2 • {F A Ei)l2 
F ■ El 
2 VF-FI2 
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So 



(^-2 {F ■ fY + 2F ■ PF ■ F + th ■ {F A P)Vf ■ F^ h 

t (F • + 2 /2 • (F A P)F -F-tF- PVF-F 

p 2VT -Fl2 
~ t 



(/2 ■ (ei A f ))/2 ^ (/2-(eiAPF))/2 
e\ ■ F 6i • PF 



Example H. In geometrical optics, the light beams from a focus of an ellipse pass 
through the other focus when they are reflected by the ellipse. 




Fig. 2 



This example can be described in the following constructive way. 

Constructions: 
points(0, F); 
elli_const(0, F, a, b, c); 
on_elli(F, O, F, a, b, c, ti); 
elli_tan(ei, O, F, a, b, c, F); 
onJine(Fi, O, OF, —1). 

Conclusion: 



FP A 6i Cl A PFi 
FP - El El ■ PFi 

Our program prints the following. 

PrcDcal: 



— OCOSftijT 









ei = - 



_ _ _c 

(F • F) (-(F • I 2 ) cos(ti)6 + asin(F)F) 



Fi = -F 
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Proof: 

ih ■ {ei AFP))h 

ei • ¥P_ _ 

{h • (ei A F) — I2 ■ (ei A P)) I2 

— El ■ F + Bi ■ P 

^ (-(F ■ F) cos(ti)6 + cos(ti)6(P • F) + asin(ti)((P A F) ■ I2)) h 
{F ■ F)asm{ti) + cos(ti)6((P A F) ■ I2) — asin(ti)(P • F) 

P b (cos(ti)c — cos(ti)^a — asin(ti)^) I2 

sin(ii) (ac + cos{ti)b'^ — d?- cos(^i)) 

^ 6/2 (cos(ti)c - a) 

sin(ti) (ac + cos(ii)6^ — oA cos(ti)) 

(/2-(^Aei))/2 

F\P ■ 6 i 

^ {—I2 ■ (ei A Fi) + I2 ■ (ei A P)) I2 
■ F\ — • P 

^ (/2 • (ei A F) + /2 • (ei A F)) I2 

6i • F + Cl • F 

^ ((F • F) cos(ti)6 + cos(ti)6(F • F) + asin(F)((F A F) • 12)) I2 
(F • F)asin(ii) — cos{ti)b{{P A F) • I2) + asin(ti)(F • F) 

P b (cos(F)c + cos(ti)^a + asin(ti)^) I2 

sin(ti) (ac — cos{ti)b'^ + a? cos(F)) 

^ 6/2 (cos(F)c + a) 

sin(ti) (ac — cos(ii)^^ + cos(F)) 



(/2 ■ (ei A FP))l2 {I2 ■ (FiF A ei))/2 
6 i • FF F\P ■ Bi 

2 6/2 cos(ti)a (c^ + 6^ — a^) 

sin(ti) {—a?d + cos(F)^6^ — 2 cos{tiYb‘^a? + cos(ti)^) 

Noticed that 

b = \/ — (? 

We have 

2 6/2 cos(ti)a (c^ + 6^ — of') 

sin(ti) + cos(ti)2fc^ — 2 cos(ti)262a^ + a^ cos(ti)^) 

= 0 



4 Solid Geometry 

In this section, we consider the Clifford algebra in 3-dimensional Euclidean 
space F^. Let 1^ denote the unit 3- vector, Qild) = G{E^), Giih) = and 

G2{h) = G2{E^)- 



124 Haiquan Yang et al. 



Suppose O is a point in space, u G ^2(^3), the plane with direction u and 
passing through the point O is defined as 

plane{ 0 , u) = {A G E^\OA Au = 0 }. 

the normal vector of this plane is —UI3. 



4.1 Description of the Method 

Similar to the situation in plane geometry, we consider the constructive state- 
ments about solid geometry. 

Definition 4 . A construction in is one of the following operations 

50) (point(X)) Take an arbitrary point X in the space. 

51) (on_plane(X, A, a,tiTbA2)) Take an arbitrary point X onplane{A,aAb). 

52) (interJineJine(X,A,a,B,b)) Point X is the intersection ofline{A,a) 
and line{B, b), where AB A a A b = 0 . 

53) (interJine-planefX, A, a, B,u)) Point X is the intersection ofline{A,a) 
and plane{B^ u). 

Sa) (inter_plane-plane-plane(X, A,ui, B,U2,C,U3)) Point X is the intersec- 
tion of plane{A,ui), plane{B,U2) and plane{C,U3). 

S5) (onJinefX, A,a,t)) Point X is on the line{A,a). 

In each case of the above, a point X in the construction is said to be 
introduced by that construction. 



Definition 5 . A constructive statement in solid geometry is represented by a 
list 

S ("^l 5 ^2 T • • ■ T G) , 



where 



— Si,i = 1, . . . ,k, are constructions in which the geometric objects introduced 

by each Si must be different from the geometric objects introduced by Sj,j = 
1, ... — 1, and other geometric objects occurring in Si must be introduced 

before; and 

— G = {E\,E2) where E\ and E2 are the Clifford algebraic expressions of 
some geometric objects introduced by the constructions Si and E\ = E2 is 
the conclusion of S. 

By the propositions in [8], the geometric objects introduced by the con- 
structions in E^ can be represented by a Clifford algebraic expression of known 
geometric objects. To test whether a constructive statement is true or not, we 
simply, from the last construction Sk to the first si, substitute the expressions of 
the introduced geometric objects into the conclusion’s expression, and simplify 
the result. 
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4.2 Algorithm and Its Implementation in Q(E^) 

We only consider the simplification of Clifford algebraic polynomials: Clifford 
algebraic expressions which contain no inverse of non-scalars and no denominator 
involving non-scalars. 

In solid geometry, we have to consider the commutator product on two 2- 
vectors. However, the commutator product can also be expressed with the inner 
and outer products since 



Ui X U2 = -{ui ■ I3) A (m 2 • is), Vmi, U2 G G2{E'^)- 



It is clear that Rules 1, Rules 2 and Rules 3 in plane geometry and the algo- 
rithm EvalJ_wl() are valid in solid geometry too. A Clifford algebraic polynomial 
is called expanded if it has been processed by algorithm EvalJ_wl. 

We will use the following rules and formulas. 

SRules 1: {A A B) A C = A A {B A C) = A A B A C 

Ar ■ {Bs ■ Ct) = (Ar A Bs) ■ Ct, where r -|- s < t and r, s > 0. 
SRules 2: For any vectors a,b,c G E^, 2-vectors u,m,U2,U3 G G2{E^), 

and 3- vector t: 

(1) a A {b ■ I3) = {a ■ b)l3, 

(2) a A{u- 13) = {a ■ u) ■ I3, 

(3) Ui A (U 2 ■ I 3 ) = (Ui ■ U2)l3, 

( 4 ) (a • I3) ■ I3 = -a, 

( 5 ) {u ■ I3) ■ I3 = -u, 

(6) (C/3)/3 = -t, 

(7) h-h = -1. 

SRules 3: For any vectors a,b,c,d G E^: 

a ■ {b A c) = {a ■ b)c — {a ■ c)b 
{a ■ b) A {c ■ d) = a ■ {b ■ {c A d)) 



SRules 4: For any vectors a, b, c, d,e, f G E^: 



a - {b Ac Ad) 

(a Ab) ■ {c A d A e) 
[a Ab A c) ■ [d A e A f) 



= (a ■ b)c A d — (a ■ c)b A d + (a ■ d)b A c 
= a ■ {b ■ [c A d A e)) 

= a ■ (b - {c ■ (d A e A /))) 



We consider the forms of the terms in an expanded Clifford algebraic poly- 
nomial. 

Class 1: the basic scalars a, ( 3 , , the basic vectors a,b,c,d, . . . , and the unit 
3- vector I3. 

Class 2: the elements which are obtained by taking inner or outer product of 
the elements in Class 1 once. For examples, the scalars a ■ b,c ■ d, . . . , 
the 2-vectors a A b,c A d, . . . and a ■ l3,b ■ I3, . . .. 

Class 3: the elements which are obtained by taking inner or outer product of 
the elements in Class 1 or Class 2 once. Some of these elements can be 
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simplified to a sum by SRules 1, SRules 2, SRules 3, and SRules 4, the 
new forms appearing in their terms are the scalars (a A b A c) • /a, . . 

the vectors (a A 6 ) • I 3 , . . and the 3-vectors a /\b Ac, 

Class 4: the elements which are obtained by taking inner or outer product of 
the elements in Class 1, Class 2 or Class 3 once. All of these elements 
can be simplified to a sum by SRules 1, SRules 2, SRules 3, and 
SRules 4, and no new form in their terms appears. 

Therefore no other new forms can be produced by this processing, and the 
forms of the Clifford terms are 

vectors! a, b, c, d, j (ct A b)-h, (cAd)-/3, 

sc&lcirs! a • &, c • cZ, ■ ■ • i A Z? A c) * A 3 , (dAeA/) -/a,...; a, /3, . . . 

2 - vectors: ahb, c A d, ;al 3 , b ■ I 3 , cl^, 

3- vectors: I 3 ; aAbAc, dAeAf, 

and an expanded Clifford algebraic polynomial is a sum of products of such 
elements and scalars. 

From the above, there are seven forms of the terms of an expanded Clifford 
algebraic polynomial 

A, Aa, A((aAb)-/ 3 ), A(aAb), A{a ■ I 3 ), AI 3 , A{aAbAc), 



where 



A = ai ... ai{ai • bi) . . . (a^ • bj)((ci A di A ei) ■ I 3 ) ■ . . ((ck A dk A Ck) ■ I 3 ). 



SRules 5: 



((a A b Ac) ■ l3)((d A e A f) ■ I3) = —{a A b A c) • (d A e A /) 

((a A b A c) • l3)d = {d ■ {a A b A c)) ■ I3 

((a Ab Ac) ■ /3)((d A e) • I3) = —{a Ab Ac) ■ {d Ac) 

{{a Ab Ac) ■ /3)(d A e) = ((a A b A c) ■ {d A e) ■ I3 

(A Ab Ac) ■ l3){d ■ I3) = —{a Ab Ac) ■ d 

{{a Ab Ac) ■ 13)13 = —(a Ab Ac) 

{{a Ab Ac) ■ l3){d A e A f) = {{a Ab Ac) • (d A e A f))h 



With the above rules and SRules 4, we can simplify a Clifford algebraic polyno- 
mial to a sum of the following eight forms 



SI 


ai . 


. ai{ai 


bi).. 


.{aj 


bj) 




S2 


ai . 


■ ai{ai 


bi).. 




bj){{c Ad A 


e) ■ I3) 


S3 


ai . 


■ ai{ai 


bi).. 




bj)c 




S4 


ai . 


■ ai{ai 


bi).. 




bj)((cAd) • 


h) 


S5 


ai . 


■ ai{ai 


bi).. 




bj){cAd) 




S6 


ai . 


. ai{ai 


bi).. 


■(% 


bj){c-l 3 ) 




S7 


ai . 


. ai{ai 


bi).. 


■(% 


bj)h 




S8 


ai . 


. ai{ai 


bi).. 


■(% 


bj){c A d A t 


2 ) 



where a\, . . .ai are basic scalars, a\, . . . aj, bi . . . bj, c, d, e are basic vectors. 



A Clifford Algebraic Method for Geometric Reasoning 127 

Definition 6. The forms SI, S2, . . S8 will be called final forms in Q(E^). A 
Clifford algebraic polynomial in Q{E^) is called a final Clifford algebraic polyno- 
mial if its terms are all final forms. 

Theorem 3. There is an algorithm to simplify a Clifford algebraic polynomial 
in Gils) to a final Clifford algebraic polynomial. 

Algorithm 9. Simplify an expanded Clifford algebraic polynomial x by SRules 1, 
SRules 2, . . ., SRules 5 to a final Clifford algebraic polynomial. 

Eval_i_w2 : =proc (.x) 

stepl: Let the terms or the factors with grades greater 

than 3 vanish. 

step2; Simplify the 3-vectors in x which is not I 3 hy t = — (t ■ 
Is)l3> that is in the reverse order of rule (6) in 
SRules 2. 

step3; Simplify x by other rules in SRules 2, SRules 1 and 
SRules 3. 

step4: Simplify x by SRules 5. 

steps : Simplify x by SRules 4. x will be a final polynomial, 

end. 

4.3 Algorithm and Implementation of Prover 

This is very similar to the case of plane geometry, so we omitted the details. 

4.4 Some Examples 

Example 3 (Monge). The six planes passing through the midpoint of the edges 
of a tetrahedron and perpendicular to the edges respectively opposite have a 
point in common. This point is called the Monge point of the tetrahedron. 

This example can be described in the following constructive way. 
Constructions: 

point (A, B,C,D)\ 
onJine(iI, A, AB, 1/2); 
onJine( J, A, AC, 1/2); 
onJine(iC, A, AD, 1/2); 
onJine(L, B, BC, 1/2); 
onJine(M, B, BD, 1/2); 
onJine(A, C, CD, 1/2); 

inter _plane_plane_plane(P, H, CD ■ I3, J, BD ■ I3, L, AD ■ I3). 

Conclusions: 



KP BC = 0; MB ■ AC = 0; NP ■ AB = 0. 



Let point A be the origin. Then our program prints the following. 
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PreDeal: 



H = 
J = 
K = 
TB = 



C 

2_ 

D_ 

2_ 

B 

~2 



C 

2 



MB = 
WC = 
P = 



D B 
2_ ^ 2_ 

D C 

{{B A D) ■ h)[B ■ C) - {{C A Py hKB^) - ((C A D) ■ h){B ■ D) 
2{h-(BACAD)) 

{(B A D) ■ h){C ■ D) - {(B A Cy h^B^) - {{B A C) ■ h)(C ■ D) 
2 {h ■ (B AC AD)) 



Proof: 



BC KP 



= B ■ K - C ■ K - P_+ C_P_ 

^B-K-C.K-^P^ 



= 0 



C-MP 



S 

P 



-C ■M + C_-P_ 



= 0 



B ■ NP 



CD 

2 



S 

P 



-B ■ N + ^- P_ 
-B-NP^ 



= 0 



B D 
2 



This example is Example 4.92 in [1]; the proof produced by [1] is too long to 
print. 

Example 4- Let ABCD be a tetrahedron and G a point in space. The lines 
passing through points A, B and C and parallel to line DG meet their opposite 
face at P, Q, and R respectively. Try to find the condition for Vqpqr = SVabcd- 
This example can be described in the following constructive way. 
Constructions: 

points(A, B, C, D, G); 

inter Jine_plane(P, A^ DG, D, DB A DC); 

inter Jine_plane(Q, B, DG, D, DA A DC); 
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inter Jine_plane(i?, C, DG, D, DA A DB). 

Conclusion: Compute GP A GQ A GR. 

Let point A be the origin. Then our program prints the following. 

PreDeal: 

_ (D -G) h ■ (D AC AB) 

~ -h ■ (D AC AB)-h ■ (D AC AG)+h ■ (D AC A^ + h ■ (C AG AB) 
-h-(D AC A B)D + I3-(DACA B)G - RJ 3 ■ (^ A C A G) 
h-(D AC AG) 

h-jPAC A B)D -h-jPACA B)G + Ch- {D AG AB) 
h-(P AG A B) 



Computation: 

-I 3 ■ (GP A GQ A GR)h 

= -{h-{PAGAQ)+l3-iPARAG)-l3-{RAGAQ)-l3-{PARAQ))h 
= h {h ■ (GAG AS) + 3/3 • (PACAB)) 



So Vgpqr = AVabcd if and only if the points A,B,C,G are on the same 
plane. 

This is an extended version of a problem from the 1964 International Math- 
ematical Olympiad. 
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Abstract. Glifford algebra formalism has been used recently in combi- 
nation with standard term-rewriting techniques for proving many non- 
trivial geometric theorems. The key issue in this approach consists in 
verifying whether two Clifford expressions are equal. This paper is con- 
cerned with the generalization of the work to 3D geometric problems. A 
rewriting system is proposed and its theoretical properties are investi- 
gated. Some examples and potential applications are also presented. 



1 Introduction 

Proving geometric theorems has become an important activity with respect to 
many domains such as geometry education, computer-aided design and computer 
vision. Several methods have been very successful in proving many difficult the- 
orems [3,10,12], and researchers are motivated to investigate the use of general 
techniques of automated deduction for finding machine proofs. 

Recently simple methods for proving geometric theorems have been devel- 
oped based on Clifford algebra formalism [5,7,8,11,13]. Their main purpose is 
to prove geometric theorems in a synthetic way, that is without using coor- 
dinates. Traditionally synthetic geometry is only used at an elementary level 
for performing simple computations. Analytic geometry has been much more 
developed under the influence of physicists and engineers for which geometric 
problems are reduced to algebraic ones. Actually the Clifford algebra formal- 
ism provides an elegant way for solving geometric problems in a synthetic style 
though it is possible to keep the advantages of the analytic point of view. 

The expressive power of Clifford algebra allows to reduce a lot of problems 
to the proof that a Clifford expression is equal to 0, or to the reduction of an ex- 
pression to a normal form. This makes very easy understanding the methodology 
for solving problems. However defining a normal form for Clifford expressions 
and computing such a normal form is a non-trivial task. This has already been 
achieved for the Euclidean plane [5,6] but the case of the Euclidean space ap- 
pears more complex. As many simplification rules may be applied to Clifford 
expressions, we have found convenient to use rewriting systems for simplifying 
such expressions. 

This paper investigates a rewriting system for systematically simplifying Clif- 
ford algebraic expressions associated with problems in three-dimensional (3D) 
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Euclidean space. We also study its theoretical properties (completeness, termi- 
nation) and give some insights on the normalization problem explaining why 
performing this task efficiently is so complex. Last, significant examples and 
applications of our simplification method are presented. 



2 A Synthetic Presentation 



This section provides some background on the Clifford algebra used in this paper 
and presents Clifford algebra from a synthetic point of view. 

First, we define what is a Clifford algebra. Let K be a field of characteristic 
not equal to 2, V a vector space of dimension n over the base field K and q a 
quadratic form over V. Let 



V+ = K©(0((g)V)), 

i=i j=i 

where © is the direct sum and © is the tensor product. V*" is the tensor algebra 
over V and its dimension is 2”. Let I{q) be the two-sided ideal of V"*" generated 
by the elements v © v — q{v) for every v G V. Let C be the quotient of V'*' 
by T{q). C is called the Clifford algebra associated to the quadratic form q. 

This definition of Clifford algebra leads to introducing coordinates over a 
basis of V and extending computations over V to C. This approach has been 
developed for instance in [1]. 

Instead of using it, we propose to consider an axiomatization of Clifford alge- 
bras specialized to some fixed geometry. For instance, an equational presentation 
of such an axiomatization for dimension two has been given in [.5]. The basic 
idea is to consider an internal law • representing the unique bilinear symmetric 
form associated with q. The outer product is represented by a binary associa- 
tive operator A and a unary dual operator ~ is also introduced to strengthen 
the expressive power. Last, the sum of two elements A and S of C is denoted 
hy A + B. There is also a well-defined product in Clifford algebra (the geometric 
product) but we do not use it in this paper except for multiplying elements of 
the basic field K with elements of C. In this case, it may be considered as the 
extension to C of the external law of V. 

More formally, we consider the Clifford algebra C of dimension 8 associated 
with the 3D Euclidean space £. Points in £ are represented by vectors in C. To 
every element in C is associated a grade (intuitively, its dimension!^ ) . Scalars are 
elements of grade 0 and vectors are elements of grade 1. The outer product of k 
vectors is called a k-vector and an element of C a Clifford number. A bivector 
is the outer product of two vectors, and a trivector is the outer product of three 
vectors or of a bi vector and a vector. In our case, there is no quadrivector as we 
are restricted to dimension 3. The rules for computing the grade of an arbitrary 
expression are given in the next section. In what follows, an expression in C is 
called a Clifford expression. 
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Let a be a vector. The dual of a, denoted by a"", is a bivector and may 
be geometrically interpreted as an equivalence class of parallelograms in a plane 
orthogonal to a. Reciprocally, any bivector may be interpreted as the equivalence 
class of parallelograms of given direction, orientation and area. The dual of a 
bivector is a vector. Last, any trivector is the dual of a scalar and reciprocally. 
As the space of trivectors is a line, one can take J as a basis for trivectors and 
define it such as I"" = 1; II is called the unit pseudo-scalar. 

Computation rules in C are derived from the definition of C and the op- 
erations: sum -h, inner product •, outer product A and (restricted) geometric 
product o (this symbol will often be omitted in expressions). These rules will 
be given in detail in the next section (see system R). The basic step of apply- 
ing Clifford algebra to geometric problems is to express geometric constructions 
using Clifford expressions. Such constructions are used in Section 8. 

The next section considers the problem of simplifying Clifford expressions 
represented by ground terms and provides a set of simplification rules. Although 
this set is not sufficient, its significance will appear when pseudo-coordinates are 
introduced in Section 6. 



3 Basic Simplification Rules 

Let V be a finite set of elements of V. We consider Clifford expressions as first- 
order terms built on constant symbols denoting some elements of K, constant 
symbols from V, and the pseudo-scalar II by means of function symbols with one 
argument (the dual operator) or two arguments (the sum, the geometric, inner 
and outer products). In the sequel we will need to define rewriting systems by 
means of conditional rules, and most conditions apply to the grades of the terms 
involved. We will obviously provide a way to compute these grades, but efficiency 
requires that the results of such computations be memorized, and hence to work 
not exactly with first-order terms but with labeled trees, where each vertex is 
labeled by a function symbol and a grade. However, in order to keep notations 
simple (and standard), this labeling with a grade will be left implicit. This 
structure will be called a Clifford term. From now on, t = t' means that t and t' 
are identical terms, while t =c t' means that t and t' are equal up to the axioms 
of Clifford algebra (the values [t] and |"t'] of t and t' in C are equal for every 
interpretation [a] of the elements a e V). 

Of course it is not always possible to attribute a grade to a Clifford expression, 
but we will consider ungraded expressions as expressions of grade T. It is also 
essential that the grades keep unchanged during the computation. But our aim 
is to rewrite expressions of grade possibly non-zero to 0, which is a scalar. This 
is the only case of non-preservation of the grade, and for this reason, we create a 
new grade especially for 0; this is oo. More precisely, oo is the grade of expressions 
that are trivially equal to 0 due to absorption properties of 0 and a few others 
as shown below. 

It is important to remind here the basic restriction we impose on Clifford 
expressions: the geometric product is only allowed with at least one scalar as 
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argument, that is a term of grade 0 or oo. The rules for computing the grades, 
denoted as gr(-), are the following: 

— gr(0) = oo; Va; G K*,gr(a;) = 0; Vu G V, gr(?;) = 1 and gr(J)=3. 

- If = gr(fi) and 52 = gr(t2) then: 

- = 3-5i; 



where K* is an arbitrary extension field of K and 

• + is the extension of + with x+oo = oo-i-x = oo, and x-i-_L=_L -i-x =_L 
when X ^ oo, 

• — is the extension of — with x— oo = oo— x = oo, and x— _L=_L— x = _L 
when X ^ oo, 

• |oo| = oo and |_L| =_L, 

• the order on N (the set of natural numbers) is extended with Vx G 
N, X <_L< oo. 

In the sequel we will define rewriting systems as sets of conditional rules, i.e. 
expressions of the form I r if C, where I and r are terms built with the same 
symbols as above and variables, and C is some condition on these variables. Such 
a set R defines a relation between Clifford terms defined by: t — t' if and only 
if there is a subterm s of f and a rule Z — > r if C in i? and a substitution cr 
such that (j{l) = s and cr(C) is true and t' is the term t after replacement of s 
by cr(r) (and gr((r(r)), and possibly a recomputation of some grades). A term t is 
a i?-normal form if there is no t' such that t t'. R is terminating if for any t 
there is no infinite sequence ti —^R t2 • For any binary relation B 

we note by its reflexive closure, B'^ its transitive closure and B* its reflexive 
and transitive closure. 

In order to state conditions about grades implicitly we use the following 
convention: the variables A,B,C will match with any Clifford term, the vari- 
ables Zi, k will match with terms of grade 0 or oo (i.e. h always comes with 
the condition gr(Zi) G {0,oo}), the variables X,Y,Z will match with terms of 
grade different from 0 or oo, and the variables x,y, z,u will match with terms of 
grade 1. 

We now consider the rewriting system E containing the following conditional 
rules: 





gr(fit2) = max(5i,52) (we have 51 or 52 in {0, 00}); 



gr(fi -h t2) 



min(5i, 52) if 51 = 00 or 52 = 00, 
9i if 9i = 92, 

_L otherwise; 



9 i 

_L 



OA^O, OAA^O, h-A^O 
AO^O, AaO^O, A-h^O 



0~^0, O-hO^O, 

A A i? — *■ 0 if gr(A) -|- gr(S) > 3. 
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It is rather easy to prove that grades are preserved by this rewriting system: 
Theorem 1. Ift t' then gr(t) = gr(t'). 

Proof. We prove by structural induction on t that if t — t' then gr(t) = gr(t'). 
In the base cases this is obvious since t = t' . If t is of the form (resp. ti A t2, 
t\ -t2, tit2, ti + t2), and t' ^0 then t' is of the form (resp. t'l t'l ^1^21 
+ t'2) and 3 i such that ti t[ (and tj = t' for j yf i); hence by induction 
hypothesis gr(ti) = gr(t'), and thus gr(t) = gr(t'). If = 0 then gr(t') = 00, 
and either t is 0 '^, or t is ti A ^2 or tit2 and 3 i such that ti = 0 , or t is ti + t2 
and ti =t2 = 0 , or t is ti • t2 and 3 i such that gr{ti) G { 0 , 00}, or t = ti At2 and 
gr(ti) + gr(t2) > 3 , and in all these cases we have gr(t) = 00. 

From Theorem 1 we can easily prove: 

Theorem 2. gr{t) = 00 if and only if t — 0 . 

Proof. If t — 0 then by Theorem I we have gr(t) = gr( 0 ) = 00. We prove the 
converse by induction on t. In the base case gr(t) = 00 implies f = 0 ; hence 
t ^ 1 ; 0 . If t = ti'^ and gr(f) = 00 then gr(ti) = 00; hence ti 0 by induction 
hypothesis, and thus t — 0 . If f = tit2 then 3 i such that gr(ti) = 00; hence 
ti —> 1 ; 0 and t — 0 . If t = ti + t2 then gr(ti) = gr(t2) = oo; hence t 0 . If 
t = t\-t2 then 3 i such that gr(ti) G { 0 , 00}; hence t — 0 . Finally, if t = fi A t2, 
then either 3 i such that gr{ti) = 00, and t 0 , or gr(ti) + gr(t2) > 3 and 
t — 0 . 

It is therefore clear that once the grades are computed, we do not really need 
to apply the rewriting system E. Instead, we apply the system E' defined as: 

A — !■ 0 if gr(T) = 00, A + 0 ^ A, 0 + A ^ A. 

By Theorem 2 it is clear that t t' implies t =c t' , since the same obviously 
holds of E. It is easy to prove by induction on t that if t is an if'-normal 
form, then either t' = 0 or does not contain any occurrence of 0 ; and hence no 
subterm of grade 00. Moreover, the termination of E' is obvious, and hence an E'- 
normal form is easily obtained. However, if f 0 , we may still have gr(f) =T, 
which can be the case only if t contains a subterm t\ + t2 with gr(ti) yf gr(t2). 
We consider the following system P[ of distributivity rules applied only to sums 
of grade T: 

if gr(H) y^ gr{B) then 

h{A + B) ^ hA + hB, 

X ■ {A + B) ^ X ■ A + X ■ B, 

CA{A + B)^CaA + CAB, 

{A + Br ^ A- + B-. 

Once again the termination of H is obvious (the depth of sums decreases), 
and an H-normal form is either 0 or a sum ti + • • • + of terms ti of grades 



{A + B)h hA + hB, 

{A + B) ■ X ^ A- X + B ■ X, 
{A + B) AC ^ AaC + B AC, 
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non _L (with any possible parentheses). However, H may produce subterms At2 
with gr(ti) + gr(t2) > 3 (from ti A (^2 + H) with gr(t2) ^ gr(i3)); hence it is 
necessary to recompute the grades and apply the system E' once more. After 
these steps, we therefore obtain an H and E' normal form, that is either 0 or a 
sum of terms t^’s whose subterms are non-zero and of grade between 0 and 3. 

In the sequel, the rules of E' will be used again, since some O’s may be 
produced, while the rules from H will not, since only sums of definite grades 
may be produced by the forthcoming rules. 

4 Reduction to Normal Form 

The exact structure of the terms obtained after applying the basic simplification 
rules can be made explicit by looking at the expressions for computing the grades, 
from which we can read the following grammar for terms of grades 0 to 3: 

Lq ::= IK*|Lo + Lo\LoLo\Li ■ Li\L2 ■ T2IA3 • T3IT0 A LqIL^'^ , 

L\ ::= Y\L\ + L\\LqLi\LiLq\Li ■ T2IA2 • T3IA3 • T2IA2 ■ Ti|ToATi|Ti AL0IL2 , 
L2 '■'■= L2 + L2I A0T2I A2T0IA1 • T3IT3 • Li\Lq a L2IA1 A Li\L2 a Lq\Li^, 

L3 ::= I\L'i -f L3IL0L3IL3TQIL0 A L^\L\ A T2IA2 A T1IT3 A TqILq"^, 

where Li is the language of terms of grade i. Hence the structure of these terms 
is still quite complex, and they can certainly be reduced by using computation 
rules (equations) from Clifford algebra. For instance, computations in K can be 
performed by standard means of computer algebra. Similarly, we can consider 
terms of grade 0, such as (1 -f- a • a)a • b + (— l)b • a, as polynomials over indeter- 
minates a • a, a • b, b • a, and perform computations using a computer algebra 
system. But this is clearly not sufficient, and we need Clifford algebra to tell us 
that b • a = a • b, and to reduce complex products. 

But we must first make clear what we shall consider as irreducible terms, i.e. 
sublanguages of Li of terms which cannot be reduced further simply by applying 
rules of Clifford algebra. More precisely, we aim at defining a syntactic notion of 
irreducibility, one that does not depend on particular relations which may exist 
among the basic vectors (elements of V) appearing in a Clifford term; these 
relations will then be easier to cope with. 

It is quite obvious that the expressions a,a~,a-b,aAb, J cannot be reduced 
further, except by eventually swapping arguments. This is also the case of a • b'^ 
and a • (b • c"^). This gives a (tentative) grammar for irreducible terms: 

LI ::= K*|V • V|V • (V • N~)\Ll + Ll\LlLl, 

L\ ■.■.= \\y ■y~\Ll + Ll\LlLl, 

L° ■■=Y-\YAY\L° + L°\L°L°, 

L°-,.= E\L° + L°\L°Ll 

and it is difficult to see how it could be made simpler (except if an expression 
is null, such as a A b -|- b A a, but it is convenient to keep a simple syntactic 
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definition of irreducibility) . We are going to prove that these irreducible terms 
can always be reached as normal forms of a rewriting system R. The rules of R 
are divided into the following 5 groups. 

(1) . Distributivity rules: 

h{X + Y)^hX + hY, 

Z ■ {X + Y) ^ Z ■ X + Z - Y, 

Z A{X + Y) ^ Z AX + Z AY, 

(2) . Linearity rules: 

{hX) - Y ^ h{X -Y), 

{hX) AY ^ h{X AY), 
h A A ^ hA, 

Xh hX, 

{hX)~ hX~. 

(3) . Duality rules: 

-hi, I~ ^1, X ■ I ^ -X~, {X~)'^ -X, 

where —t stands for (— l)f. 

(4) . Ordering rules: 

X AY ^ AX AY A X, 

X -Y ^ (_l)|a-h|-min(a.6)y .X AY A X, 

where a = gr(X) and b = gr(T), and ^ is an order on the set of ground 
terms. 

The following definition of A is convenient. We assume an order < on V, 
and also that for any constants c, c' such that gr(c) < gr(c') we have c < d . 
This order is extended to the set of function symbols in the following way: 
•<A<~<o<+. This symbol precedence is used to define an order on ground 
terms. The definition is given by structural induction and extends the order < 
on constants. Let t and t' be two terms. Then t A t' A and only if one of the 
following conditions is satisfied: 

— gr(t) < gr(t'); 

~ gr(t) = gr(f'), o ^ {hd(t),hd(t')} and hd(t) < hd(t'), where hd(-) denotes 
the head symbol (i.e. the operator); 

— gr(t) = gr(t') and t = hA and t' = kB, where either A A B or A = B and 
h A k] 

— gr(t) = gr(t') and t = A and t' = kB, where either A A B or A = B; 

— gr(t) = gr(t') and t = hA and t' = B, where A A B] 

— gr\t) = gr(t') and hd(t) = hd(t') = / yf o, where t = f{ti, . . . ,tm) and 

t' = f{t[, . . . , t'^), and there is some i S {1, . . . , to} such that ti A t ■ and for 

every} G tj = t' . 



{X + Y)~ ^ X- + Y-, 

{x + Y) ■ Z ^ X ■ Z + Y ■ Z, 
{X + Y) A Z ^ X A Z + Y A Z. 



X ■ {hY) h{X ■ Y), 
X A{hY) h{X AY), 
X Ah ^hX, 
h{kX) — > {hk)X, 
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In these rules, the scalar (—1)^ has to be computed each time a rule is 
applied. Alternatively, we can consider these rules as a shortcut for the set of 
rules obtained from these by fixing the values of a and b (in {1,2,3}). The 
reason for these two ordering rules is to reduce the number of rules in R. Remark 
that ^ allows to perform fast comparisons between terms. 

(5). Reduction rules: 

(5.1) . (xAy)^^x- 

(5.2) . {x ■ y'^y ^ -X Ay, 

(5.3) . xA{yz~)^{x- y)z~ - {x ■ z)y~ , 

(5.4) . X Ay'" ^ —(x ■ y)I, 

(5.5) . X A {y A z) ^ X ■ {y ■ z~)I, 

(5.6) . X ■ {y A z) ^ {x ■ y)z - {x ■ z)y, 

(5.7) . {x ■ y~) ■ z~ ^ {x- z)y - {y ■ z)x, 

(5.8) . {x ■ y~) ■ {z ■ u~) {x ■ z){y ■ u) - {x ■ u){y ■ z), 

(5.9) . x~ ■ y~ ^ -X ■ y, 

(5.10) . {x Ay) ■ z'" ^ X ■ {y ■ z'"), 

(5.11) . {x A y) ■ {z A u) ^ {x ■ u){y ■ z) — {x ■ z){y ■ u). 

We can now prove that the i?-normal forms are exactly the irreducible terms. 
Here, we consider that the order < on V is 0, which results in the smallest 
possible 

Theorem 3. Vt € Li, t is a R-normal form if and only if t € L°. 

Proof. The “if” part is easy: we prove by induction on t G L° that t is a, R- 
normal form. The base cases are a • b and a • (b • c"^) for i = 0, a and a • for 
i = 1, a"^ and a A b for i = 2, and II for z = 3, and these are obviously i?-normal 
forms. If f = + t 2 C L°, then ti,t 2 G L°-, hence by induction hypothesis they 

are i?-normal forms, and so is t. li t = t\t 2 G L°, then ti G Lq and t 2 G L° 
are i?-normal forms, and so is t. 

We now prove by induction on t G Li that if t is a i?-normal form then 
t G L°. The base cases are t gK.* for z = 0, t € V for z = 1, and t = J for z = 3, 
which are i?-normal forms and are in L°. We now consider t G Li a i?-normal 
form; then every strict subterm t' of t is a i?-normal form as well, and hence 
t' G by induction hypothesis. If t = fi + 12 then ti,t 2 G L°; hence t G L°. 

II t = tit 2 , then according to a linearity rule t\ G Tg and t 2 G L°; hence t G L°. 
We now consider the case where t is not a sum or a geometric product: then t 
does not contain any sum or geometric product; otherwise some distributivity 
or scalar rule could be applied to t. Hence any strict subterm t' (with grade j) 
of t, being in L°, is of the form a • b or a • (b • c"^) if j = 0, a or a • b'^ if j = 1, 
a'^ or a A b if z = 2, and II if j = 3. We consider all possible cases: 

If t = tx" with t\ G L°, then j yf 0 and j 3 by duality rules. If j = 1 
(z = 2) then ti cannot be of the form a • b"^ because of rule (5.2); hence ti is 
of the form a, and f € T°. If } = 2 (z = 1) then ti cannot be of the form a A b 
because of rule (5.1), and t\ yf a~ because of duality; hence j = 2 is impossible. 
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If t = ti A t 2 with ti € L°,t 2 G then j 0,/c 0 due to scalar rules, 

j < k due to ordering rules, and of course j + fc < 3. We have t G L° when 
j = k = 1 {i = 2) and t\,t 2 G V. In all other cases t ^ L°, and we prove that t 
is not a i?-normal form. In case j = fc = 1, if ^2 = a • then rule (5.3) applies, 
and if = a • and t 2 G V then an ordering rule applies. If j = 1, fc = 2 
{i = 3), then t 2 ^ a~ because of rule (5.4) and t 2 7 ^ a A b because of rule (5.5). 

If t = fi • f 2 with ti G L°i't 2 G then j y^ 0,fc yf 0 since t £ Li and 
j < fc by ordering. If fc = 3 then t 2 = L, which is impossible by duality rules. 
Hence we have 3 possible cases for j and fc. In case j = fc = 1 (i = 0), if ti G V 
we have t G L°; otherwise ti = a • b'^, and ^2 G V is impossible by ordering, 
while ^2 G V • V"^ is impossible due to rule (5.8). If j = 1, fc = 2 (i = 1), then 
t G L° when t\ G Y,t 2 G V'", and the other cases are impossible: t 2 = b A c 
because of rule (5.6), and ti = a ■ b'",t 2 = c"" because of rule (5.7). The last 
case j = fc = 2 is impossible: when ti = a'^, ^2 = b A c is impossible by ordering, 
and t 2 = b"" because of rule (5.9), and when ti = a A b, t 2 = c~ is impossible 
because of rule (5.10), and t 2 = c A d because of rule (5.11). 



It is clear that the above result can be generalized to any order <, by restrict- 
ing the L°’s to the smallest elements w.r.t. the corresponding It is however 
not necessary to give further details of this, though in the sequel we consider 
only linear orders < on V, so that ^ is also a linear order. 

The previous theorem does not assert that the system is terminating but 
only gives syntactic criteria for irreducible terms. Termination is the focus of the 
next theorem. 



Theorem 4. RU E' is a ground terminating rewriting system. 



Proof. We proceed in four steps. First we prove that the system (1) -|- (2) is 
terminating. Then we prove that (3) -I- (5) also has this property. In the third 
step we prove that (1) -I- (2) -|- (3) -I- (5) is terminating by introducing a function 
from the algebra of terms to N. We will conclude by considering the applicability 
of rules in the group (4) together with the system E' and the computation of 
grades. 

Let < be an order on function symbols: -|-<o<-<A<~. Let <m be 
the multiset path ordering extension of < (see [4]); then is a simplification 
ordering and for every rule Z — > r in (1) or (2) it is clear that r <m 1. This proves 
the termination of (1) -I- (2). 

We note by t[p ^ u] the result of the substitution by the term u at position p 
in the term t. Let p be the following map, defined by structural induction, from 
the algebra of terms to (N, <); when a; is a functional symbol of arity n, p{x) is 
the map from N" to N associated to x: 



Clifford Term Rewriting for Geometric Reasoning in 3D 



139 



y.{r^){x) = 4a;, 

K-){x,y) = 2xy, 

K°){x,y) = xy, 

K+){x,y) =x + y, 
fi{F) = 3, 

/i(c) = 1 for every constant c ^ I. 



It is easy to check that for every rule Z ^ r in (3) + (5) we have y,{r) < fi{l). 
Moreover, if t is a ground term, p a position in t, u and v two terms such that 
pi{v) < y,{u) then /i(t[p ^ tt]) < p{t[p ^ f]), i.e. /i is strictly monotone. As 
(N, <) is well-founded, (3) -I- (5) is ground terminating. 

The map p also has the following property: for every rule Z — > r in (1) -|- (2) 
/i(Z) = p{r). Let us assume that (1) -I- (2) -|- (3) -I- (5) is not terminating. As 
(1) -|- (2) is, there would be an infinite sequence of terms with strictly decreasing 
measure /i, which is impossible. 

A similar argument may be used for proving the termination of R. Let (ti) be 
an infinite sequence of terms successively rewritten by rules in R. As for every 
rule / — > r in (4) /a(Z) = p{r), there exists an i such that for every natural j > i, tj 
is irreducible by any rule in (3) -I- (5). We can assume without restriction that 
j = 0. Let us prove by induction on n the number of occurrences of -I- in to that 
such an infinite sequence cannot exist. 

If n = 0, only the rules in (2) and (4) may be used. Let ly be the map defined 
as follows: 



u{x A y) 

v{x ■ y) 
v{x~) 
u{xy) 
u{x + y) 

u{c) 



i2{v{x) + v{y)) iix<y, 
2(j^(a;) -I- ^(y)) -I- 1 otherwise, 
v{x^y), 
v{x), 

v{x) + v{y), 
v{x) + iy{y), 

( 0 for every constant c G K, 

[ 1 otherwise. 



For every rule Z — > r in (2), i/{r) < iy(l). But for every rule Z — > r in (4), 
v{r) < v{l). As V is strictly monotone and (N, <) well-founded, there exists an i 
such that the only rules applied to the tj (j > i) are those in (2) and as (2) is 
terminating, iti) cannot be infinite. 

If n > 0, it is easy to see that there exists an i such that the root symbol of ti 
is -I-. As the number of occurrences of the symbol -I- in a term cannot increase, 
this means that there exist u and v such that ti = u + v and the number of 
symbol -I- in u and v is less than n. Moreover, no rule can be applied to the root 
of ti. By induction hypothesis, the sequence is finite. 

We see that for every rule Z — > r in A' we have p{r) < So even when 
the rules in the system E' are taken into account, it is necessarily terminating. 
Note also that according to the previous argument, we can add any rewrite rules 
Z — > r with fj,(r) < pil) while preserving the termination property. 
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5 Further Simplifications 

We have therefore proved that an irreducible form of a Clifford expression can 
indeed be reached in finite time. However, we have seen that the rewriting sys- 
tem R may not be able to rewrite a term to 0, even if t =c 0; this is due to the 
fact that what we have called irreducibility is only a syntactic property, as is 
clear from the fact that all rules in R are left-linear (there is at most one occur- 
rence of variables in their left-hand side), and hence that basic relations among 
constants are not exploited. We do this by means of the following geometric 
rules: 



a; A a; ^ 0, a; • a:"^ — > 0, a: • (a; • — > 0. 

Clearly these rules are still not enough to rewrite a — a to 0; but for this only 
standard algebraic computations are required, and it is convenient to sum up 
the corresponding rules into a single one: 

X Y if X computes to Y and Y ^ X. 

This computation is to be performed by some computer algebra system, by 
considering the subterms of X which are not elements of K as indeterminates. 
Of course, we have to trust the system that n{Y) < n{X), but this is a very 
reasonable assumption. 

In order to minimize the number of indeterminates, we have to care for 
equalities such as a • b'^ = — b • a'^. In order to avoid non-termination we use 
the order ^ previously defined. Then 

a; • > — y ■ x"' if y ^ x, x ■ {y ■ z'^) —y ■ {x ■ z^) if y < x. 

Note that the measure fi is unchanged by application of these basic ordering 
rules, and hence the argument used in Theorem 4 can be applied here. 

The set of these rules (geometric, algebraic computations, basic orderings) 
will be noted Z . Applying the rewriting system Z (obviously terminating) to a 
term may replace subterms by 0; hence it is necessary to include the elimination 
rules from E' (and to recompute the grades each time a 0 is produced). Hence 
the rewriting system to be applied to a term in Li in order to reduce it to 0 is 
E' U RU Z, which is terminating as shown by the above remark. 

The rules from R are all needed but sufficient for obtaining the reduced form 
defined above. Unfortunately, the system E'URUZ is not complete with respect 
to the reduction to 0, due to the fact that R is not confluent. For instance, take 
t = ((a • b'^) • c'^) • d'^, which is not a normal form: rule (5.7) applies, but in two 
different ways, yielding either 

ti = ((a • b'^) • d)c — (c • d)(a • b'^) or t 2 = {a ■ c)(b • d"^) — (b • c)(a • d'^), 

which are both i?-normal forms. Hence t — ti or t — t 2 (U, ^2 form a critical 
pair of R) according to the strategy used to apply the rules; hence either t—t 2 — 



Clifford Term Rewriting for Geometric Reasoning in 3D 



141 



ti — t 2 or t — ti t 2 —ti, and ti — t 2 is a, E' U RU Z-normal form, despite the 
fact that ti — t 2 =c 0- The system cannot be complete if it is not confluent. 

Of course, the standard way to make a rewriting system confluent is to add 
rules derived from the critical pairs, either ti t 2 or t 2 t\, or some other 
rules able to rewrite ti and t 2 to a common term. The many critical pairs of R 
can actually be solved by the following three rules: 

(a) , (a; • y){z ■ {u ■ v~)) - (x ■ z)(y ■ (u ■ v'^)) 

(x ■ u)(v ■ (z ■ y'")) - (x ■ v)(u ■ (z ■ y'")), 

(b) . {x- {y z'"))u {x ■ u){y ■ z'") + {y ■ u){z ■ x~) + (z ■ u)(x ■ y~), 

(c) . {x ■ {y ■ z'^))u~ — > {x ■ u){z f\y) + {y ■ u){x f\ z) + {z ■ u){y A x). 

However, these rules are non-terminating, and hence they do not guarantee con- 
fluence. From a practical point of view, rule (b) may be difficult to control (or we 
may restrict it to the case where u is not of the form t ■ and rule (a) needs 
a special order to ensure termination. Moreover, they are here written up to the 
associative-commutative properties of the geometric product and the sum, and 
rule (a) is not left linear, and this implies complex and expensive matchings. 
They are not right-linear either, and hence also imply term duplications. For all 
these reasons we postpone the study and use of these rules. 

It should be noted that confluence does not imply completeness, and does 
not solve either the problem of the strategy for applying the rules. Even if all 
sequences of rewriting steps converge to the same result, some may converge 
much faster than others. The complexity of the application of the rules (higher 
if non-linear) should also be addressed. 

Since some rules are not right-linear (the distributivity rules and rules (5.3), 
(5.6)-(5.8) and (5.11)), we should be careful to apply such rules so as to avoid 
the duplication of unnormalized terms, and hence of rewriting steps. This implies 
an innermost strategy: if two rules can be applied, one to t and the other to t' 
and t' is a subterm of t, then the rule on F is applied first. We also give priority 
to linear over non-linear rules. 

We only have three rules which are not left-linear: the geometric rules. The 
matching of the left-hand side of these rules therefore involves syntactic equality 
testing between terms. However, in the context of the innermost strategy, we 
can restrict the application of these rules to the case where the variables are 
matched with elements of V, so that testing the equality becomes trivial. It is 
indeed clear that with this strategy, the geometric rules can only be applied to 
terms t At, t-t'^ and t ■ {t - where t and t' are i?-normal forms. By noting Z' 
the system Z with the above-mentioned restriction on the geometric rules, and 
R' = E' U RU Z' , we have: 

Theorem 5. If t, t' G L° then 

t A t 0, 
t ■ t- 0, 
t-{t- tn 0. 

This remains true with an innermost strategy. 
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Proof. We prove the first two, by induction on t. If t G V, the result is trivial 
by applying a restricted geometric rule. If t G V • V"", say t = a • b~, then t At 
rewrites to ((a • b"") • a)b'" — ((a • b"") • b)a'" by rule (5.3), and thus to 0 by 
restricted geometric rules (and algebraic computations). Similarly, rewrites 
to (a • b'^) • (—a • b) by rule (5.2), then to —[((a • b'^) • a)b — ((a • b"^) • b)a] 
by rule (5.6) (and linearity), and finally to 0 by geometric rules and algebraic 
computations. 

If t = + f 2 , then t At rewrites to A + ti A ^2 + ^2 A ti + t 2 A ^2 by 

distributivity, and to 0 by induction hypothesis, anticommutativity of A and 
algebraic computations, and similarly for t ■ . If t = t^ti with fp G Lq, then 

similarly t At and t ■ t~ rewrite to 0 by linearity rules and induction hypothesis. 

We prove the last line by induction on t, then on t' . If t G V, say t = a, 
we prove that Vt',f • ft ■ t'^) — 0. For t' G Y this is trivial. If = a • b"^, 
then t ■ ft ■ t'^) rewrites to —a • (a • b A c) by rule (5.2) and linearity, then to 
— [(a-b)(a-c) — (a-c)(a-b)] by rule (5.6), linearity and distributivity, and finally 
to 0 by algebraic computation. The inductive cases t' = tf + and t' = tft'^ are 
proved as above. The case f = a • b"" is similar: t' G V is solved by rule (5.7), 
and t' gY ■ Y'" by rules (5.2) and (5.6). The inductive cases for t' are solved as 
usual. The induction on t can then proceed: if t = fi + ^ 2 , the base case t' G Y 
is trivial, and f' = a • b"" is solved by rules (5.2) and (5.6), and the induction 
on t' is as above. If t = toti, then the base case t' G V is trivial, and the case 
t' = a • b"^ is solved by rules (5.2) and (5.6), and the induction on t' is as above. 
All these rewriting steps follow an innermost strategy. 

It is therefore clear that the rewriting system R' is complete over E' U RU Z 
for rewriting terms to 0, as far as an innermost strategy is used for applying the 
rules. Although it is not complete over equality in Clifford algebra, it provides 
simple normal forms for Clifford terms. Intuitively, the incompleteness of this 
rewriting system is due to the lack of a general mechanism to use the linear 
dependence of any four vectors. 



6 Pseudo-coordinates 

A possibility for computing normal forms of expressions in Clifford algebra and 
proving equality consists in using both the rewriting system presented in the 
previous sections and pseudo-coordinates. Although it is not possible to get a 
complete rewriting system, it is possible to simplify a null expression to 0 by using 
the previous rules on transformed expressions. The transformation consists in 
replacing any vector by a linear combination of three independent vectors. In 
some sense, this transformation is equivalent to the introduction of coordinates 
for any vector. That is why it is called pseudo-coordinates expansion. 

Pseudo-coordinates are a practical means of using coordinates in our syn- 
thetic framework. They also ensure completeness of our approach. The main 
idea is the following: let a, b be two non-zero non-collinear vectors, with a -< b. 
Then a, a • b'^ are two orthogonal vectors and a, a • b'^, (a A (a • b'^))'^ form 
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an orthogonal basis. Here a and b are called the reference vectors. It follows 
that any vector c may be expressed as a linear combination of the three latter 
vectors with Clifford coefficients. If c =c xa + y{a ■ b"^) + z{a A (a • b'^))'^, then 
c • a =c x(a ■ a); hence 

c • a 



and similarly: 

c • (a • b~) c • (a A (a • b'"))'" 

^ ^ (a • b"") • (a • b~) ’ ^ ^ (a A (a • b~))~ • (a A (a • b~))~ 

Using the reduction rules given in the previous section, one gets 

(a A (a • b'^))'^ =c (a • b)a — (a • a)b. 



Let 



K = (a ■ a)(b • b) — (a • b)^; 



we have 



K=c (a A (a • b~))'" • (a A (a • b'"))'" =c (a • b"") • (a • b""), 

and hence [k] = 0 if and only if [a • b"^] = 0 if and only if [a] and [b] are 
collinear. By replacing these expressions in x, y, z and factorizing a, b and a-b"^, 
we get the pseudo-coordinate expression of c relative to a and b: 

c =c pce(c, a, b) = aa + /?b + y(a • b"") 



with 

(c • a)(b • b) — (c • b)(a • b) 

5 

K 

(c • b)(a • a) — (c • a) (a • b) 

5 

K 

c • (a • b'^) 

K 

Pseudo-coordinates may be introduced in a Clifford term t by replacing c with 
pce(c,a, b). After all vectors in V (except a and b) are expanded this way, we 
obtain an expression g =c t containing fractions: g is the pseudo-coordinate 
expansion of t with respect to the reference vectors a and b. In order to eliminate 
the denominators, we first multiply the whole expression by a suitable power of k, 
and compute the normal form of the resulting term; we obtain the reduced term e 
associated to the expression g. Hence there is an n such that K^t =c K^g =c £• 
It is clear that Vc € V \ {a, b}, Kpce(c, a, b) is a linear combination of a, b and 
a • b'^, where the coefficients are polynomials over IK in the indeterminates a • a, 
a • b, b • b, c • a, c • b and c • (a • b'^). 



a = 

/9 = 

7 = 
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Theorem 6. e is a linear combination o/l,a,b,a-b'^,a'^,b'^,aAb and II with 
coefficients being polynomials over K in the indeterminates a • a, a • b, b • b, and 
all c- a (or a - c if a ^ c), c-b (or h-c), c-(a - b"^ ) (or a - {c-h'^) or a - (h-c'^)) 
for every c gY \ {a, b}. 

Proof. By induction on t: In the base cases t t = a, t = b, t e K, this is 
obvious. If t = c, this is true of Kpce(c, a, b), and hence of e which is identical 
to Kpce(c,a, b) up to basic orderings. Suppose now that it is true of ti and t2] 
then it is obviously true of t\ + 12 and tit2 (if gr(ti) G {0, oo}). To see that it is 
also true of ti~,ti ' ^2 and ti A t2, it is sufficient to check that it is true of the 
normal forms of: 

— the duals of 1, a, b, a • b"^, a"^, b'^, a A b and I, 

— the inner and outer products of any two terms in {l,a, b,a • b'^,a'^,b'^, 
a A b, J}, 

and to apply linearity and distributivity rules. 

The following theorem is crucial in our soundness proof. 

Theorem 7 (Extension of generic Clifford algebraic 3D identities). Let h be a 
Clifford term of grade 0 and t be any Clifford term. If h 0 O'^d for every value 
\h'] yf 0 => [t] = 0 then t =c 0. 

Proof. It is clear that ht =c 0. Let ei, e2, 03 be three constants not in V denoting 
vectors. Let t' be the term t in which all c G V are replaced by X^ei + X|e2 + 
Affea where Xf are constants of grade 0. We note X the set of these constants. 
By applying the same transformation to h we get the term h' . 

We consider an orthonormal basis of V as a fixed interpretation of (ei , 02, 63), 
such that [63]"" = [ei] A [02]. Every interpretation of X can be extended 
this way: the result is an orthonormal interpretation of X. We note the 
equality modulo orthonormal interpretations. It is clear that there is a bijective 
correspondence between V-interpretations and orthonormal X-interpretations 
compatible with the replacement above: hence t =c 0 if and only if t' =(i 0, and 
ht =c 0 if and only if h't' =(i 0. It is easy to check that: 

ei • ei =2 02 • 02 =c 63 • 03 =2 1, 

Oi • O2 =c 62 • 03 =2 Oi • 03 =2 0, 

0iA02=c03~, 02 a 03=^01'", Oi A O3 -02'", 

ei-®2"=c“®3 , e2 ■ =(i -ei, ei ■ =(l 62. 

Following the previous theorem, and using the rewriting system R' as well as the 
equations above, h't' can be reduced to a linear combination of 1, Oi, 02, 03, Oi'^, 
62"^, 63"^ and I, with coefficients being polynomials in IK[X]. Since h't' =(i 0, its 
scalar, vector, bivector and trivector parts are null. The scalar (resp. trivector) 
part is the polynomial coefficient of 1 (resp. F); hence this polynomial is null. 
The vector part is a linear combination of ei , 02 , 63 , and these three vectors are 
independent; hence their polynomial coefficients are null. The bivector part is a 
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linear combination of ei"^, 62'^, 63'^, and these three bivectors are independent; 
hence their polynomial coefficients are null. But these coefficients are the product 
by h' of the corresponding coefficients of t' , which are therefore null since h' 0. 
Hence t' 0, and t =c 0. 

Completeness and soundness are direct consequences of the two previous 
theorems: 

Theorem 8. t =c 0 if and only if e = 0. 

Proof. If t =c 0, then =c £ =c 0; since e is a linear combination of 
l,a, b,a • b'^ (which are independent vectors), a'^,b'^,a A b (which are inde- 
pendent bivectors) and I, as in the previous proof their polynomial coefficients 
are null, and can be rewritten to 0 by standard algebraic computations. Since e 
is in normal form, we have e = 0. 

If e = 0, K^t =c 0 and as k" 0, it follows from the previous theorem that 
t=c 0. 

Pseudo-coordinate expansion can be applied to any Clifford expressions, but 
it is important to compute the normal form of t first, because even if it does not 
yield 0, it may considerably reduce t and hence ease the process of expansion. 

Remark that applying pseudo-coordinate expansion only involves basic prop- 
erties of the equality and rules from R', which proves that the equational theory 
obtained from R' is complete for =c- 



Experimental Considerations 

The time necessary for simplifying an expression according to the previous 
method highly depends on the choice of the basic vectors. In the worst case, 
this is equivalent to the introduction of scalar coordinates. However it appears 
that in many cases a good choice of the reference vectors reduces significantly 
the number of simplification steps to be performed. The whole method and a 
good strategy for introducing pseudo-coordinates are presented below. 

The main problem in pseudo-coordinate expansion consists in selecting the 
two reference vectors. In geometric constructions, some vectors are taken to be 
free. As we deal with space problems, there are usually two or more free vectors. 
It is thus possible to consider two of them as the reference vectors. Following 
this strategy, any vector c different from the reference vectors, say a and b, 
is replaced by its expansion pce(c, a, b) using pseudo-coordinates. We call this 
global expansion. 

However a more efficient strategy may be used. Instead of choosing the ref- 
erence vectors once for all, one may decide to choose them according to the 
neighborhood of the constant in a term. For instance, let us consider the follow- 
ing (sub-) expression 



(c-d)(c-e). 
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If we decide to expand it using a, b as the reference vectors, one gets a much 
larger expression than using d, e. In the latter case, the expression is already ex- 
panded. The key idea consists in minimizing the number of monomials generated 
by expansion. To achieve this goal, we propose the following two heuristics. 

— Order-based local expansion. First, the set of constant vectors is ordered. 
As innermost terms are reduced first, we compute the two maximal vectors 
according to this order in each subterm and take them as the reference 
vectors for the expansion in this subterm. 

— Count-based local expansion. Occurrences of vectors in a term are counted, 
and the two vectors with the two largest numbers of occurrences are taken 
as the reference vectors. 

As pseudo-coordinate expansion introduces a sum, this may lead to a combi- 
natorial explosion of the number of terms. That is why in practice this expansion 
should be applied only after a first normalization. It appears that the proof of 
many geometric theorems does not require pseudo-coordinates (see Example 1 
in Section 8). 

Of course this depends on the formulation of the theorems. In some sense 
we can assert that any constructive geometric theorem may be formulated in 
a way such that it does not require to introduce pseudo-coordinates: simply 
imagine that constructive expressions are built directly with pseudo-coordinates. 
However, this would hardly qualify as a coordinate-free method! In practice, we 
want to formulate theorems in the simplest possible way, and postpone the use 
of pseudo-coordinates as far as possible. 

Another important point concerns the use of computer algebra algorithms for 
simplifying scalar expressions. This point is crucial for efficiency and the reader 
may refer to [6] for further details. Last we stress the fact that the considera- 
tions on the priority of rules (presented in the previous sections) are of prime 
importance for the efficiency of rewriting. 



7 Geometric Reasoning 



The rewriting techniques and system for proving Clifford identities presented 
in the previous sections have an interesting application to geometric reasoning. 
This application is based on the observation that many geometric concepts and 
relations can be represented by means of Clifford algebraic expressions. There- 
fore, reasoning about geometric problems may be reduced to manipulating such 
expressions and to investigating their relationships. 

Consider for example the case of proving geometric theorems. From [.5,7,11], 
one sees that many theorems in plane and solid Euclidean geometry can be 
formulated by using Clifford algebra, where the hypothesis-relations of a theorem 
in question are expressed as a system of Clifford algebraic equations. By means 
of vectorial equations solving as presented in [8], the hypothesis-relations may 
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often be transformed into the following triangular form 

= 0 , 

/i2(M,Xi,X2) = 0, 

hr{fJ;Xl:X 2 ,--- , Xr) = 0, 

where each hi is composed using Clifford operators, /jl = (fii , . . . , Hd) are free 
geometric entities (called parameters), and Xij--- iXr are constrained by the 
geometric hypotheses (called dependents). Examples of geometric entities are 
points, areas of triangles and other geometric invariants. 

To make the triangiilarization possible and easy, it is usually required that 
the theorem under consideration is stated constructively. To reduce the trian- 
gularizing cost, the last two authors [ 6 , 11 ] have considered a restricted class of 
geometric theorems for which each dependent Xi can be solved in terms of pi 
and Xii ■ ■ • iXi-i using Clifford operators; namely, 

Xi = /*(m,Xi,--- ,Xi-i): l<i<r. (H*) 

A number of popular constructions with the corresponding expressions of fi in 
plane Euclidean geometry are provided in [6,11]. In fact, by solving vectorial 
equations as in [7] similar constructions and their corresponding Clifford rep- 
resentations may be established for 3D Euclidean geometry, where the vector 
space V is of dimension 3. Here are two examples. 

— Construct the vertex X such that P 1 P 2 P 3 X is a parallelogram: 



A = parg(Pi, P2, ^3) = P1+P3- P2- 



This representation holds for plane Euclidean geometry as well. 

— Construct the intersection point X of line P 1 P 2 and plane T 3 T 4 T 5 : 

A = \ntp{P^P 2 ,P 3 ,P 4 ,P 5 ) = APi + (1 - X)P2, 



where 



[P3 A P4 A P5 - J^2 A (P4 AP5 + P3XP4- P3A T’s)]"-' 

[{Pi - P2) X {Pa A P5 + P3 X P4 - P3 X P3)]- 

The non-degeneracy condition is: the denominator is non-zero, i.e., P 1 P 2 [f 
P3PaP5- 



Such explicit Clifford representations may help make the proving method 
more effective. In deriving them, some (non-automated yet elementary) geomet- 
ric reasoning is often needed. We do not get into the details in this paper. 

Meanwhile, constructive geometric theorems of this type have also be studied 
by Yang, Zhang and Feng [13] , using Clifford algebra formalism and similar prov- 
ing techniques. A set of constructions with geometric objects including points. 
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lines, circles, and ellipses in both 2D and 3D with the corresponding Clifford 
expressions have been given by them. 

Suppose that the conclusion of the theorem to be proved is given as another 
Clifford expression 

5 = 5(m,Xi,--- ,Xr) = 0. (C) 

Let di be the denominator of fi in (H*) for 1 < z < r. Proving the theorem 
amounts to verifying whether (C) follows from (H*), under the subsidiary (non- 
degeneracy) conditions fy 0. 

The class under discussion (referred to as class C), though restricted, is large 
enough to cover many interesting theorems in elementary geometry. The method 
we use as proposed in [5,6,11] for confirming theorems in C consists of the fol- 
lowing three steps. 

Step A. Let fl = fi- Do the following substitution for z = 2, . . . , r: 



fi — fi (A^) — /i|xi=/i*.--- ,Xi-l=/*_i ~ /l I • ■ • J fi-l)- 

Step B. Substitute all Xi = f* into g: 



5* =5*(m) =5lxi=/i*.....xr=/* - ,/*)■ 

Let h be the numerator of g* . 

Step C. Apply a term-rewriting system with suitably chosen rewrite rules to h. 
If h is rewritten to 0, then the theorem is true under the non-degeneracy 
conditions fy 0 for 1 < z < r. If /z does not evaluate to 0 when the 
(pseudo-)coordinate rules are applied, then the theorem is false under the con- 
ditions di fy 0. 

It is clear that in the above method both Clifford algebraic computing and 
term-rewriting are required. The former can be done in a computer algebra 
system, and the latter has been discussed in detail in the previous sections. 

The entire method has been implemented in Maple V for algebraic computing 
and in Objective Cam I for term-rewriting with an interface between the two 
systems. Here we do not enter into the implementation details (see [6] and a 
forthcoming paper by us). 



8 Application Examples 

In this section we present some examples to illustrate the application of our 
rewriting techniques and system to 3D geometric problems. 

For any two vectors a and b, their cross product is defined as 

a X b = (a A b)"^. 

Our first example shows how the rewriting procedure works by proving the well- 
known Jacobi identity 



a X (b X c) -I- b X (c X a) -I- c X (a X b) = 0 
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for arbitrary 3D vectors a, b, c. 

Example 1. Prove that 

a A (b A c)'" + b A (c A a)'" + c A (a A b)'" = 0 
holds for any 3D vectors a, b, c. 

Our system is able to produce a trace of the rewriting steps. We give below 
some of them; the others similar or unessential for understanding the proof are 
omitted. 



rewriting 

-> 

rewriting 

-> 

rewriting 

-> 

rewriting 

-> 



(b~c) ~ 

b. c~ done. 

a~ (b . c~) 

(a.b)*c~+(-l)*((a. c)*b~) 

(c~a) ~ 

c . a~ done . 

b~ (c . a~) 

(b . c) *a~+(-l) * ( (b. a)*c~) 



done . 



done . 



rewriting b.a 

-> (l)*(a.b) done. 

(. . .) 

rewriting (a.b)*c~+(-l)*(a.b)*c~ 
-> 0*c~ done. 

(. . .) 



0 

(S+U) Rewriting Time = 0.04s 
(S+U) Simplification Time = 0.00s 
(S+U) Communication Time = 0.01s 

The next two examples are taken from [7]. The proofs produced here are 
somewhat simpler than those given by Li therein. 

Example 2. Let a plane intersect the sides AB,AC,CD,DB of an arbitrary 
tetrahedron ABCD at four points M, N, E, F respectively such that MNEF is 
a parallelogram. Then the center of EJMNEF lies on the line connecting the 
midpoints of AD and BC. 
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A 



B 




E 



D 



C 

Let D,E,F,M be free points. Without loss of generality, we can take M 
as the origin: M = 0. The other dependent points may be constructed and 
represented as follows 



where a scalar parameter u is introduced, and the expressions of the constructed 
points have already been substituted into the expression of the point being con- 
structed (see step A of the method in the preceding section) . The exact meanings 
of on_line, int and other predicates, which are not really needed for understand- 
ing the examples in this section, can be found in [6,11]. The conclusion of the 
theorem to be proved is 



g = {E - A- D) /\{B + C - A- D) = 0. I col(midp(M,L;), midp(A,D), 



Substituting the expressions of the dependents into g, one finds that the resulting 
expression may be easily rewritten to 0. Therefore, the theorem is proved to be 
true under the non-degeneracy condition m yf 0 (i.e., C does not coincide with E). 

Example 3. Let ABCD be an arbitrary tetrahedron, and a plane which is 
parallel to both AB and CD cut the tetrahedron into two parts, intersecting 
the sides AD, AC, BC, BD at points P, Q, R, S respectively. Let the ratio of the 



N = parg{0,E,E) = E-F, 

C = on Jine(H, E) = uD -|- (1 — u)E, 



1 — u 

A = intp(C, N, 0,E,D) = D + F, 

u 

B = int(A, 0, D, F) = uD -I- (1 — u)F, 



midp)^,!!!)) 
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distances from the plane to AB and CD respectively be r > 0. Determine the 
ratio s of the signed volume of the two parts. 

A 



D 




Since the plane is parallel to AB and CD, it is easy to see that 

\AQ\ \AP\ \BR\ 

\QC\ ~ \PD\ ~ \RC\ ~ |5D| 

Let Q be located at the origin, i.e. Q = 0, and A,B,P be free points. Take C 
on line AQ such that \AQ\ : \QC\ = r; then 

c = -T 

r 

Similarly, take R on BC such that \BR\ : \RC\ = r, D on AP such that \AP\ : 
\PD\ = r, and S on BD such that : liSIll = r. We have 

„ B + rC ^ (r + l)P-A ^ B + rD 

R = ^ — : — I D = , b 



■ 



r + 1 



The points P,Q,R,S constructed above lie necessarily in the same plane. The 
non-degeneracy conditions are r yf 0 and r — 1, which are already satisfied by 

the hypothesis. Let 

voI(Pi,P2,P3, A) = {P2 - Pi) A (P3 - Pi) A (P4 - Pi), 

whose magnitude is six times the volume of the tetrahedron P1P2P3P4. Then 
the ratio we wanted to find is 



s = — 



vol(A, P, Q, R) + vol(A, P, P, S') vol(A, P, B, S) 
vol(P, P, g, P) -k vol(P, P, P, S) -k vol(P, Q, C, P) ■ 



Simple substitution of the expressions of the dependent points with rewriting 
yields 

r^(r -I- 3) 



s = 



3r-h 1 
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The following example is the one that takes the largest amount of proving 
time among 20 examples given in [2]. 

Example 4. Let M and N be two opposite vertices of an arbitrary paral- 
lelepiped. Then the diagonals of the hexagon formed by the mid-points of the 
edges that do not pass through M or N are concurrent. 



Let M be located at the origin (i.e., M = 0) and the vertices A,B,C be 
free points, and denote the six mid-points of the edges by Mi, . . . , Mq as in the 
above figure. Obviously, we have 



It is easy to verify that 

(Ml - Ml) A (M2 - Ms) A (M3 - Me) = 0, 

so the three diagonals M1M4, M2M5, M3M6 are coplanar. Let M2M5 and M^Mq 
meet at point H; then 



This may be done easily by substituting the above expressions of Mi,M4,iL 
into g (and rewriting the result to 0). As a side effect, we can see from the above 
proof that the intersection point of the diagonals is actually the centroid of the 
parallelepiped. 

Term-rewriting is not heavily used for the above examples, while proving the 
following theorem requires quite extensive rewriting to simplify the conclusion- 
expression to 0. 




N 



M 



C 




H = intp(M2,Ms,M3,M6,0) 



A + B + C 



2 



The conclusion we need to prove is 



g = {Mi-H)A (Ml - Ml) = 0. 
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Example 5. Let ABCD be an arbitrary tetrahedron and M be the intersection 
point of the line CD and the bisector of the dihedral angle formed by the two 
planes ABC and ABD. Denote the altitudes of AABC and AABD on the base 
AB by he and ho respectively. Then he '■ ho = \CM\ : \DM\. 

A 




Without loss of generality, we take A as the origin; i.e., A = 0. Let B, C, M 
be free points. Note that the normal vectors of the planes ABC and ABM are 
{B A C)^ and {B A M)^ , respectively. Let the normal vector of the plane ABD 
be denoted by n. Then n is the symmetrical vector of {B A C)^ with respect to 
{B A M)'^ . It is thus easy to represent n in terms oi B,C,M as follows 

n = 2{BA Cr ■ {B A M)~{B A M)~ -{BA M)~ ■ {B A M)~{B A C)~ . 

The point D is the intersection of the line CM and the plane passing through 0 
with normal vector n, so 



n • (M A C) 
n • (M - C) ■ 



The altitudes he and ho can be easily represented: 



he = C-C- 



{B ■ C)2 
BB ’ 



ho = D ■ D — 



{B ■ Df 
B ■ B 



The conclusion of the theorem is 



g = ho{M -C)-{M -C)- he{M - D) ■ {M - D) = 0. 

It is proved by substituting the expressions of n, D, he, ho into g and rewriting 
the resulting expression to 0. 

As the ratio of the areas of AABC and AABD is the same as he '■ ho, the 
theorem stated as Example 4 in [13] may be considered as a corollary of the 
above theorem. With the following conclusion-equation 
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g* = {B /\D) ■ {B A D){M -C)-{M-C)-{BAC)-{BA C)){M - D)- 

{M-D)=0, 

we have proved the corollary directly with less difficulty. 

An essential matrix is a 3 x 3 matrix that can be expressed as a skew- 
symmetrical matrix postmultiplied by a rotation matrix. It plays a central role 
in two- view estimation as shown in the computer vision literature [9]. A rank 
two matrix E is an essential matrix if and only if the equality (1) below holds. 
There are several other properties about the essential matrix; we shall prove one 
of them in the following example. Let 



E = (ei G2 ea) 



be any 3x3 matrix, where is the Ah column vector for each i. Assume that 
&i -Gj yf 0 for all i,j. For any matrix M, let AE^ and tr(M) denote the transposed 
matrix and trace of M respectively. 

Example 6. If 



EE^E = -tr{EE^)E, 



( 1 ) 



then 



ea ei 

g — 1 

ei 02 e2 ■ ea 



62 

ea • ei 



= 0 . 



Let t = tr{EE'^); then 



( 2 ) 



t = ei • ei -|- e2 • e2 -l- ea • ea. 

The equality (1) is thus equivalent to the following set of conditions 

hi = 2 (ei • ei ei -I- 02 • ei 02 -I- ea • ei 03) - tei = 0 , 

/12 = 2 (ei • 02 ei -h 02 ■ 02 02 -I- 03 • 02 03) — t02 = 0 , 

It -3 = 2 (ei • ea ei -I- 02 • 03 02 -I- 03 • 03 03) — tea = 0 . 

Therefore, proving that (1) implies (2) is reduced to proving that 

[hi = 0 , /i2 = 0 , ha = 0 ] 9 = 0 - ( 3 ) 

However, this cannot be done with a straightforward application of our method. 
For none of the e^ can be represented in terms of the other vectors, nor can 
the hypothesis-expressions be easily triangularized. Even using coordinates, it is 
also not easy to prove (3). We have tried Wu’s method without success, yet a 
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machine proof can be obtained by using Grobner bases only with respect to the 
total degree term ordering. 

Fortunately, we are able to give a simple proof of (3) with two rewriting steps: 
Let g* be the numerator of g. The first step eliminates the term ei • 63 02 ■ 03 63 
from g* using hi = 0, and the second step of generalized rewriting reduces the 
resulting expression to 0 using /i2 ■ 63 = 0. More precisely, we have 

g* — > g* -Q2- 63 hi — > g* -62 ■ 63 hi - h2- 63 ei — ^ 0, 

so (3) is proved. Moreover, we have seen that the hypothesis /13 = 0 is not needed 
for proving the conclusion (2). 

Although the above proof was produced on computer not automatically, we 
would be able to find such a proof automatically when the generalized rewriting 
technique is appropriately incorporated into our program. A systematic investi- 
gation on generalized rewriting techniques and their mechanization, presenting 
machine-generated proofs, and interpreting Clifford algebraic non-degeneracy 
conditions geometrically all remain for future research. 



Acknowledgments 

This work has been supported by CEC under Reactive LTR Project 21914 (CU- 
MULI) and by CNRS/CAS under a cooperation project between LEIBNIZ and 
CICA. 



References 

1 . Ablamowicz, R., Lounesto, P., Parra, J. M.: Clifford algebras with numeric and 
symbolic computations. Birkhauser, Boston (1996). 131 

2. Chou, S.-C., Gao, X.-S., Zhang, J.-Z.: Automated production of traditional proofs 
for theorems in Euclidian geometry - II. The volume method. Tech. Rep. WSUCS- 
92-5, Department of Computer Science, Wichita State University, USA (1992). 
152 

3. Chou, S.-C., Gao, X.-S., Zhang, J.-Z.: Machine proofs in geometry. World Scientific, 
Singapore (1995). 130 

4. Dershowitz, N., Jouannaud, J.-P.: Rewrite systems. In: Handbook of theoretical 
computer science, vol. B (J. van Leeuwen, ed.), Elsevier, Amsterdam, pp. 243-320 
(1990). 138 

5. Fevre, S., Wang, D.: Proving geometric theorems using Glifford algebra and rewrite 
rules. In: Proc. CADE-15 (Lindau, Germany, July 5-10, 1998), LNAI 1421 , 
Springer, Berlin Heidelberg, pp. 17-32. 130, 131, 146, 148 

6. Fevre, S., Wang, D.: Gombining algebraic computing and term-rewriting for geom- 
etry theorem proving. In: Proc. AISG ’98 (Plattsburgh, USA, September 16-18, 
1998), LNAI 1476 , Springer, Berlin Heidelberg, pp. 145-156. 130, 146, 147, 148, 
150 

7. Li, H.: Vectorial equations-solving for mechanical geometry theorem proving. J. Au- 
tomat. Reason, (to appear). 130, 146, 147, 149 



156 Thierry Boy de la Tour et al. 



8. Li, H., Cheng, M.-t.: Proving theorems in elementary geometry with Clifford alge- 
braic method. Adv. Math. (Beijing) 26 : 357-371 (1997). 130, 146 

9. Maybank, S. J.: Applications of algebraic geometry to computer vision. In: Compu- 
tational algebraic geometry (F. Eyssette and A. Galligo, eds.), Birkhauser, Boston, 
pp. 185-194 (1993). 154 

10. Wang, D.: Geometry machines: From AI to SMC. In: Proc. AISMC-3 (Steyr, Aus- 
tria, September 23-25, 1996), LNCS 1138 , Springer, Berlin Heidelberg, pp. 213- 
239. 130 

11. Wang, D.: Clifford algebraic calculus for geometric reasoning with application to 
computer vision. In: Automated deduction in geometry (D. Wang, ed.), LNAI 
1360 , Springer, Berlin Heidelberg, pp. 115-140 (1997). 130, 146, 147, 148, 150 

12. Wu, W.-t.: Mechanical theorem proving in geometries: Basic principles. Springer, 
Wien New York (1994). 130 

13. Yang, H., Zhang, S., Feng, G.: Clifford algebra and mechanical geometry theo- 
rem proving. In: Proc. 3rd ASCM (Lanzhou, China, August 6-8, 1998), Lanzhou 
Univ. Press, Lanzhou, pp. 49-63. 130, 147, 153 



Some Applications of Clifford Algebra to 
Geometries 
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Abstract. This paper focuses on a Clifford algebra model for geometric 
computations in 2D and 3D geometries. The model integrates symbolic 
representation of geometric entities, such as points, lines, planes, circles 
and spheres, with that of geometric constraints such as angles and dis- 
tances, and is appropriate for both symbolic and numeric computations. 
Details on how to apply this model are provided and examples are given 
to illustrate the application. 



1 Background and Notations 

Applications of Clifford algebra to geometries were originated in the work of 
Grassmann and Hamilton in the 19th century [23]. For nearly 150 years, there 
have been various applications of Clifford algebra to geometries, in the names of 
Grassmann algebra, Grassmann-Cayley algebra, quaternions, dual quaternions, 
Dirac algebra, Pauli algebra, spinor representation, Clifford algebra, dual vector 
algebra, geometric algebra, etc. In the language of geometric algebra formulated 
by Hestenes [16], for nD Euclidean geometry, there are four typical Clifford 
algebra models: 

(1) The Clifford model Qn- Points and directions of the nD Euclidean space 
are both represented by vectors in 7?,", and these vectors generate the Clifford 
algebra C/„ [7,8,16,17]. 

The Clifford model has it origin in Grassmann’s work on geometric com- 
puting [19,34], and is completed by Clifford who combined Grassmann’s work 
with Hamilton’s quaternions [23]. As a result, vector algebra and quaternions 
are naturally included in this model. The model is very convenient for geomet- 
ric computations involving Euclidean transformations of points and orthogonal 
transformations of directions. 

(2) The Grassmann model Gn+i- The nD affine Euclidean space is embedded 

as a hyperplane of with unit distance from the origin. Points are repre- 
sented by vectors in while directions are represented by vectors in 72.". 

The space 72”+^ generates the Clifford algebra Gn+i [18,19,34]. 

The Grassmann model was first proposed by Grassmann [34] . In coordinate 
form, this model gives the homogeneous coordinates representation of points in 
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projective space. This model is very efficient in geometric computations involv- 
ing projective transformations, and has been extensively used in the study of 
computer vision from the geometric viewpoint [1,2,3,4,9,10,25,36]. 

(3) The degenerate model Qn,o,i- The Clifford algebra Qn,o,i is generated by 

vectors in 7?.” © (cq), where vector cq is null and is perpendicular to 7^"; the nD 
Euclidean space 7?." as a subspace of 7?."'©(eo) is imbedded in the Clifford algebra 
Gn, 0.1 by the mapping x I + CqAx, for x € 7?.” [5,6,20,21,24,37,38]. This model 

usually occurs in the form of dual quaternions and dual vector algebra. 

The degenerate model was first proposed by Clifford in the form of dual 
quaternions, and further developed by Study [37], Blaschke [5] and others. Yang 
and Freudenstein [38] applied another version of this model, the dual vector 
algebra, in mechanism analysis. This model is an important computing tool for 
Euclidean transformations of straight lines and planes. 

(4) The Wachter model Gn+i.i- The nD Euclidean space 7^” as a subspace 

of the Minkowski space is imbedded in the null cone of the Minkowski 

space through the mapping x fo+x+{x'^ /2)f, where /, /q are two null vectors 
orthogonal to TZ^. Gn+i,i is the Clifford algebra generated by the Minkowski 
space. 

The Wachter model was first found in the work of Wachter (1792-1817) [35], 
a student of Gauss. It is closely connected with distance geometry [12,13,35], 
conformal geometry [14] and the geometry of spheres [31,32]. Various applica- 
tions in engineering, computer vision and molecular structures can be found in 
the literature [12,13,14,31,32]. 

In a recent series of papers [22,27,28,29,30] , Li, Hestenes and Rockwood estab- 
lished a universal Clifford algebra model for conformal geometries of Euclidean, 
spherical and double-hyperbolic spaces. It is shown that the three geometric 
spaces can be obtained from the null cone of the same Minkowski space through 
three kinds of projective splits [18], and all the important analytical models for 
hyperbolic geometry can be obtained by simply changing the viewpoint of a 
projective split. The Wachter model is part of the universal model in the case of 
Euclidean geometry. 

In this paper, the unified theory of the Wachter model will be applied to 2D 
and 3D Euclidean geometries, which are the most commonly used ones in ap- 
plication. Details of algebraic representations and manipulations of geometric 
entities and constraints will be presented in various lists and examples. The 
present paper intends to show that, the Wachter model is capable of preserving 
the advantages of the other three models while making the representations and 
computations of distances, lines, planes, circles and spheres more easily. 

This paper is organized as follows: an introduction of the Wachter model 
in 2D case is given in Section 2. A discussion of the connections of the Wachter 
model with other models is given in Section 3. The Wachter model in 3D ge- 
ometry is presented in Section 4. Section 5 presents some applications of the 
Wachter model in geometries. 
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The definition and basic operators of Clifford algebra can be found in various 
literature, for example [16,17,22,26]. To facilitate the readers, we present below 
a brief explanation of the definition and basic operators of the algebra. 

A real Clijford algebra generated by an n-dimensional real inner product 
space V" of signature^ {p,q,r), is the quentient algebra of the tensor algebra 
0(V”) by the two-sided ideal generated by elements of the form x ^ x — x ■ x, 
where a; € V" and the dot is the inner product in V". The Clifford algebra 
is denoted by ^(V"), or Gp,q,r when we stress the signature of the space. An 
element of a Clifford algebra is called a multivector, and the multiplication of 
the algebra induced from the tensor product is called the geometric product. 

The inner product of the space V" can be extended to the whole of ^(V"'). 
The geometric product also induces a product, called the outer product, in the 
algebra. In fact ^(V") taken as a vector space, together with the outer product, 
form a Grassmann algebra. The outer product is denoted by “A” . There is an- 
other important product in Clifford algebra, called the cross product. The cross 
product of two miiltivectors x,y is xxy = (xy—yx) /2, where the juxtaposition of 
multivectors represents the geometric product. We follow the notation x'^ = xx 
for a multivector x, and call it the square of x. 

For two multivectors x, y, if xy = yx = 1, we say x, y are both invertible and 
write X = y~^, y = x~^ . Not all multivectors are invertible. For two multivectors 
x,y, if xy~^ = y~^x, we write xy~^ = y~^x = x/y. 

A fundamental concept in Clifford algebra is r-blade (blade of grade r). 
Let ai, . . . , Or be vectors of V", then oi A • • • A is called an r-blade if it is 
nonzero. The square of a blade is a scalar, which equals the inner product of 
the blade with itself. The magnitude of a blade x is defined as |a;| = \/\x^\. 
The reverse of an r-blade x is defined as x^ = (— The main anti- 
automorphism of an r-blade a; is x* = (— l)’'x. The latter two concepts can be 
extended to any multivector by linearity. 

Let X be an invertible blade. The projection of a multivector y onto x is 
defined by Px{y) = {y ■ x)x~^ . The dual of a multivector y with respect to x is 
defined by yx^ . In particular, when x is an n-blade, called a pseudoscalar, the 
dual is denoted as y'^ . The meet of two multivectors y, z is defined as y\/ z = y'^ -z. 

The set of all r-blades where r is even, together with TZ, generates a vec- 
tor subspace ^+(V") of ^(V"). It is closed under the geometric product, and 
therefore is a subalgebra of 5(V"), called the even subalgebra. 

Let ai, . . . ,Or be invertible vectors of V". Then ai • • • is called a versor. 
When r is even, oi • • • is called a spinor. The adjugation of a multivector y by 
a versor x is defined as Ad*{y) = x*~^yx. When x is a spinor, since Ad%{y) = 
x~^yx, we can use a different symbol Adx{y) to represent Ad*{y). 

^ The signature of a real inner product space is a triplet of nonnegative integers whose 
sum equals the dimension of the space; the first (second, third) integer equals the 
dimension of a maximal subspace where every vector x satisfies x-x>0(<0, =0). 
A real inner product space with signature (p,q,r) is often denoted by 77^’'^’^; when 
r = 0 it is denoted by 77^’'^; when g = r = 0 it is denoted by 77^; when p = r = Q it 
is denoted by TZ~'^ . 
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2 The Wachter Model for the Plane 



2.1 Basic Representations 



Consider the Minkowski space . It contains the plane as a subspace. The 
orthogonal complement of TZ? in the Minkowski space is denoted as TZ}'^, which 
is a Minkowski plane. Let (/, /o) be a hyperbolic pair of the Minkowski plane, 
i.e., 

f = /o = 0, / • /o = -1. (1) 

The following mapping 



Ai-^A — fo + A+ 



(2) 



maps a vector A G TZ^ to a null vector A in TZ^’^, i.e., = 0. A fundamental 

property of this mapping is that the mapping, being one-to-one and onto the set 



{a; S TZ^’^\x'^ = 0, x ■ f = — 1}, 



(3) 



satisfies 



A-b = 



\A-B\^ 

2 



(4) 



for A,Bg TZ^. The set (3) together with its connection with the plane (2), is 
called the Wachter model of the plane. (4) says that in the Wachter model, the 
square of the distance between two points on the plane can be represented by 
the inner product of their null- vector representations. This property decides the 
importance of the Wachter model in the study of distance geometry [12,35]. 

Another main application of the Wachter model is in the conformal geometry 
of the plane, i.e., in the study of the transformations that always map a line or 
circle to a line or circle. In the Wachter model, a line passing through two points 
A,B has the representation f A A A B; a circle passing through three points 
A, B, C has the representation A A B AC . The representation is unique up to a 
nonzero scale. 

To derive the above representations of lines and circles, we need the Ptolemy’s 
Theorem in plane geometry, which says that four points Ai, i = 1, ... ,4, are on 
the same line or circle if and only if det(|Ai — AjIAaxa = 0. Since 



{A\ A A 2 A A3 A A/^'^ — det(Ai • A^-)4 x 4 — Y0 det(| Ai — A^- p)4x4, (5) 

the four points A^, i = 1, ... ,4, are on the same line or circle if and only if 
Ai A A2 A A3 A A4 = 0. Therefore we can use A A B A C to represent the line 
passing through A, B, C if they are collinear, or the circle passing through them 
otherwise. A point D is on the line or circle if and only iiAABACAD = 0. 

Next we show that the vector / is always on the blade A A B A C if points 
A,B,C are collinear. Assume that A ^ B, then C = AA -|- (1 — X)B for some 
X G TZ. As a. result, 

fAAABAC=fA (/o -|- A) A (/o + B) A (/o -I- AA -|- (1 — X)B) = 0. (6) 
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This conclusion guarantees that we can use f f\ A f\ B to represent line AB, as 
it differs from A A B A C only by a nonzero scale. 

Indeed, the null vector / represents the point at infinity of the plane (not 
the concept in projective geometry), and is on every line of the plane. So line 
f aAaB can also be explained as the circle passing through the point at infinity 
and points A, B. The null vector /o represents the origin of the plane, as it is 
the representation of the zero vector of the plane. 

For a circle on the plane, the usual way to describe it is through its center and 
radius; for a line, the usually way is through its normal direction and distance 
from the origin. In the Wachter model, a circle with center O and radius p is 
represented by (O — (p^/2)/)'^; a line with unit normal n and n-distance 5 (the 
signed distance from the origin to the line in the direction of n) is represented 
by {n + 5f)~. 

These representations can be obtained as follows: let A be a point on the 
plane. First, since A 






( 7 ) 



is on the circle with center O and radius p, i.e., |A — 0| = p, if and only if 
AA(0— (p^/2)/)'" = 0, therefore the circle can be represented by (O— (p^/2)/)'". 
Second, since A 



A A (n + = (A • (n + = {A ■ n — <5)"^, (8) 



is on the line with unit normal n and n-distance S, i.e., A • n = (5, if and only if 
A A (n -I- Sf)'^ = 0, therefore the line can be represented by (n -|- Sf)^ . 

The representations of circles and lines through their geometric data, together 
with the previous representations through their points, enable us to derive the 
center and radius of a circle from its three points, and the normal and distance 
to the origin of a line from its two points. The formulas are provided in the 
following table, which is on representing basic geometric entities and constraints 
in the 2D geometry within the Wachter model. 



Geometric entities and constraints 


Representations 


The origin 


fo 


The point at infinity 


f 


The square of the distance between 
points A, B 


-2A-B 



Some Applications of Clifford Algebra to Geometries 161 



Line AB 


f aAab 


Line where 1 is the direction 


f aAai 


The line with unit normal n and signed 
distance 5 away from the origin 


(n + Sf)^ 


The ratio of collinear line segments AB 
and CD 


f aAab 

f AC AD 


The signed distance from line {A^l) to 
point B 


{f A A Al A B)~ 


The signed area of triangle ABC 


{f aAab AC)~ 


The inner product of vectors A — B and 
C-D 


U AAAB)-{f AC AD) 


The intersection of lines AB and CD 


{f aAaB AC)~D 

{fAAABA{C-D))^ 

{f aAaB AD)~C 

{f aAaB A{C - D))~ 


The circle passing through points 
A,B,C 


Aabac 


The circle with center O and radius p 


(o-y/r 


The center of circle ABC 


(Aabac)^/ 

-2(i AB AC) ■ if aAaB AC) 


The square of the radius of circle ABC 


{Aabac)^ 

{jaAabac)^ 


The square of the distance between 
point D and the center of circle ABC 


Aabac AD 

p^-2 ^ ^ 

f A Aabac 


The signed distance from line (n + 5f)^ 
to the center of the circle ABC 


uaAab AC 

s , , , 

f A Aabac 


The intersection of circles ABC and 
DBF 


{AABAC)y{DAEAF) 
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The intersection of line AB and circle 
CDE 


if A A AB)\J {C AD AE) 


The point of tangency of line AB and 
circle CDE 


{{C AD AE)\j{f AAAB))-{f aAaB) 


The point of tangency of circles ABC 
and DEE 


{{daeaf)\/{Aabac))-{Aabac) 


The tangent line of circle ABC at 
point D 


f A{D ■ {AaB AC)) 


The constraint that points A,B,C are 
collinear 


f aAaB AC = 0 


The constraint that points A,B,C,D 
are on the same circle 


AaB AC AD = 0 


The constraint that lines AB,CD are 
parallel 


f aAaB AC = f aAaB AD 


The constraint that lines AB,CD are 
perpendicular 


(/ A i A B) • (/ A C A D) = 0 


The constraint that lines AB^CD, EE 
are concurrent 


if A A AB AC) ■ {f AD AE AF) 

= {f A A A B A b) ■ U AC A E A F) 



2.2 Conformal Transformations 

A basic theorem in 2D conformal geometry with the Wachter model is that any 
conformal transformation can be realized by the adjugation of a versor in G 3 . 1 , 
and any versor realizes a conformal transformation through its adjugation. Two 
versors induce the same conformal transformation if and only if they are the 
same up to a nonzero scalar or pseudoscalar factor [30] . In this section we provide 
Clifford algebra representations for several typical conformal transformations. 

Let M be an algebraic entity in the Wachter model representing a geometric 
object on the plane, i.e., a point, line or circle. 

(1) The reflection oi M with respect to line AB is represented hy Ad* , 

as f h A f\ B represents the line. 

(2) The inversion of M with respect to circle ABC is Ad*^^^^^{M). 

(3) The rotation of M, centered at the origin and with signed angle 9 relative 
to the orientation of the plane, is represented by Ad^ei 2 / 2 {M), where I 2 is the 
unit pseudoscalar representing the oriented plane [16]. 

(4) The translation of M along vector I is represented by Adi_f^i/ 2 {M). 
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(5) The dilation of M, centered at the origin aird with positive ratio e"^, is 
represented by Adg-A/A/ 0 / 2 (M). For example, when M = A where A is a point, 
then Ad^-xfAfo/^{M) represeirts poiirt e^A. 

(6) The dilation of M, centered at the origiir and with iregative ratio — e'^', is 

represented by Adf^fA.fo)e-^f''fo/ 2 {M). For example, wheir M = A where A is a 
point, then (M) represents point — e'^'A. 

(7) The special conformal transformation of a point X along vector I is defined 
to be point 1/(1/A' — l/l). For a geometric entity M, its image under this 
transformatioir is represeirted by Adi^fgAi/ 2 {M). 

2.3 Geometric Relations between Two Entities 

The geometric eirtities are assumed to be poiirts, hires and circles. The following 
results are direct applications of the theories proposed in [27]. 

1) . Point D is inside circle ABC if and only if — ^ , > 0. This can 

fAAABAC 

be obtained by comparing the radius of the circle with the distance between D 
and the center of the circle. 

2) . For circles ABC, DEF, they intersect, are tangent, or do not intersect if 
and only if 

{{AaB AC)y {D AE AF)f >0, =0, <0, 

respectively. 

3) . When circles ABC, DEE do not intersect, there is a pair of points that 
are inversive to each other with respect to both circles. The two points, called 
Poncelet points, correspond to the two null vectors 

\AaB AC\\A2\{AaB AC)^ ±\D AE AF\A2 • (A ABAC), 

where A 2 = {A A B A C) V {D A E A F). 

4) . For line AB and circle CDE, they intersect, are tangent or do not intersect 
if and only if 

((/ A AAB) V (CAB AB))2 > 0, =0, <0, 



respectively. 

5). For two intersecting circles ABC and DEE, where A,B,C and D,E,F 
are both ordered anticlockwise, the inner product of the two outward normal 
vectors of radius length of the circles at any point of the intersection equals 

^ _ {AaB AC) -{DAE AF) 

~ {f A A A B AC) ■ {f A D A E A F) 

Therefore t represents a geometric invariairt. We say the two circles are near, 
orthogonal, or far, ift>0, = 0, <0, respectively. 
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Fig. 1. The geometric relations between two circles. 



2.4 Geometric Relations among Three Entities 

There is an algebraic expression characterizing most of the relations among three 
geometric entities. Let a,b,c be vectors in and let 

s(a, b, c) = (a ■ b)(b ■ c)(c ■ a). (9) 

When a, &, c are null or of positive square, the sign of the scalar s(a, b, c) is 
independent of the length and sign of each vector, and therefore is a geomet- 
ric invariant. Below we discuss some inequality constraints of three geometric 
entities using this invariant. The proofs are omitted because they are direct 
applications of the tables in Section 2.1. 

1. For points A, B, they are on different sides of line CD if and only if s(A, B, 
if AC A D)'^) > 0, i.e., 

if A A A C A D) ■ (f A B A C A D) < 0. 

2. For points A, B, they are on different sides of circle CDE if and only if 
s{A, B,{C AD A E)"") > 0, i.e., 

{AaC AD AE) -{B AC AD AE) <Q. 

3. For point A and parallel lines BC,DE, A is between the two lines if and 
only if 

s(i, if ABA C)~, if AD A E)~) < 0. 

4. For point A and intersecting lines BC, DE, A is in one of the two acute- 
angled regions of the plane divided by the lines if and only if 

s(i, if ABA C)~, if AD A E)~) < 0. 
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5. For line AB, circle CDE whose center does not lie on AB, and point F 
inside the circle, F and the center of the circle are on the same side of the 
line if and only if 

s(F, (/ A i A B)^, {CADA E)~) > 0. 

6. For line AB, circle CDE whose center does not lie on AB, and point F 
outside the circle, F and the center of the circle are on the same side of the 
line if and only if 

s{F, if aAa B)~, iCADA E)~) < 0. 

7. For far circles ABC, DEF, point X is inside or outside both of them if and 
only if 

s(A', (AaBA Cr, (DAE A F)^) < 0. 

8. For near circles ABC, DEF, point X is inside or outside both of them if 
and only if 

six, (i A B A Cr, iDAEA F)'^) > 0. 

9. For intersecting lines AB, CD, the center of circle EFG is in one of the two 
acute-angled regions of the plane divided by the lines if and only if 

siiE AF ACr,if AAAB)~,if AC AD)~) < 0. 

10. For parallel lines AB,CD, the center of circle EFG is between the lines if 
and only if 

siiE A F A Cy ,if A A A B)~ ,if A C A D)~) < 0. 

11. For far circles ABC, DEF, their centers are on the same side of line GH if 
and only if 

siif AG A Hy, iAABA C)'^, iDAEA F)'^) < 0. 

12. For near circles ABC, DEF, their centers are on the same side of line GH 
if and only if 

siif A G A H)'^, iAABA C)'^, iDAEA F)^) > 0. 

13. For three circles ABC, DEF, GH I, they are all far from each other, or two 
are far and both near to the third, if and only if 

s((i ABA Cr, iDAEA F)^ , (G A 77 A ID < 0. 



166 Hongbo Li 



3 Connections with Other Clifford Algebra Models 



Having obtained some feeling about the Wachter model, we now discuss the 
connections and differences of this model with other Clifford algebra models: 
the Clifford, Grassmann and degenerate models. 

In what follows, within the framework of 2D geometry, we present in each 
model representations of typical geometric entities and constraints. Some typical 
geometric entities are: points, directions, angles, areas, distances, ratios, line 
segments, triangles, circles. Some typical geometric constraints are: collinearity 
of three points; concurrence of three lines; parallelism (perpendicularity) of two 
lines; co-circularity of four points. 

In the Clifford model, a point A is represented by the vector from the origin, 
which is on the plane, to the point. A direction is represented by a vector. Also, 



an angle as a scalar, is represented by the inner product of two unit vectors 
along the sides of the angle; 

the signed area of triangle ABC is {{A — B) A {A — C))""/2; 

the square of the distance between points A, B is (A — B)^; 

the signed ratio of collinear line segments AB with CD is {A — B)/{C — D); 

a line passing through point A and with unit direction I is represented by 

the pair (A, Z), i.e., a point B is on the line if and only if (H — A) A Z = 0; 

the signed distance from line (A, Z) to point C is ((C — A) A Z)'"; 

, . , ,, / , / „ \ . BAm, AaZ 

the intersection of two lines (A, Z), (ij, m) is Z — to; 

Z A TO Z A TO 

a circle with center O and radius r is represented by the pair (0,r^), i.e., a 
point A is on (inside) the circle if and only if (A — O)^ — = 0 (< 0); 

the projection of a point C onto a line (A, Z) is A -|- P;(C — A) = A + ((C — 
A)-l)l- 

the reflection of point B with respect to line (A, Z) is A -|- Ad^ (H — A); 
the rotation of point A by angle 9 is Ad^e/^i^ (A); 
the translation of point A along vector Z is A -|- Z; 

the constraint that three points A, B, C are collinear is represented by (A — 
H)A(A-C) = 0; 

the constraint that two lines AB, CD are parallel is represented by {A — B)A 
(C-D) = 0; 

the constraint that two lines AB, CD are perpendicular is represented by 
{A- B) ■ {C - D) = 0. 



Since TZ"^ is a subspace of the Clifford model Q 2 is a subalgebra of the 
Clifford algebra Qz,! realizing the Wachter model. So this model is naturally 
included in the Wachter model. Another realization of the Clifford model is to 
use the fact that a point A in the Clifford model is represented by the vector 
from the origin to the point, which corresponds to the directed line segment 
/ A /o A A. The set {/ A /o A A| A S TZ?}, together with the inner product in 
and the following outer product g: 



g{f AfoAA,f AfoAB) = f A/oAAaB, 
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for A,Bg TZ^, realize the Clifford model within the Wachter model. This real- 
ization is called the theorem of conformal split [19] of Tif. 

We see that in the Clifford model, circles and lines do not have algebraic 
representations that can join algebraic manipulations as single terms. Also the 
representations of affine properties, for example the signed area, the collinear 
constraint, are not as simple as in the Wachter model. 

To describe affine properties, it is convenient to embed the plane into a 3D 
vector space, which is denoted by TZ"^ 0 (eo) , where vector eo is orthogonal to TZ"^ . 
By doing so we can represent an affine transformation with a 3 x 3 matrix. 

When we choose e§ = 1, we obtain the Grassmann model, where the plane 
is the one in TZ^ = TZ^ (cq) passing through the end of vector Cq. A point in 
the plane corresponds to the vector from the origin to the point. A direction is 
represented by a vector in TZ^. Moreover, 

— line AB is represented by A A B; 

— the signed area of triangle ABC is {A A B A C)~ /2] 

— the signed ratio of collinear line segments AB with CD is A A B/C A D; 

— the constraint that three points A, B, C are collinear is represented by A A 
BAC = Q] 

— the constraint that two lines AB, CD are parallel is represented by A A C A 
D = BAC AD; 

— the constraint that three lines AB, CD, EF are concurrent is represented by 
{A ABA C)~{D AEAF)~ = {AABA D)~{C AEA F)~; 

— vector a represents a point if and only if d{a) = cq • a = 1; it represents a 
direction if and only if d{a) = 0; 

~ the direction of line AB is represented by d{A A B) = cq ■ {A A B) = B — A; 

— the square of the distance between points A,B is (i9(A A B))^; 

— the distance between line AB and point C is |A A B A C|/|9(A A B)|; 

— a circle with center O and radius r is represented by the pair (0,r^); 

~ the intersection of two lines AB and CD is (A A B) V (C A D) /d{{A A B) V 
(CAB)); 

— the constraint that two lines AB, CD are perpendicular is represented by 
d{AAB)-d{C AD) = t). 

Compared with the Clifford model, the simplicity in representing affine prop- 
erties here is obvious. One drawback of the Grassmann model is that the inner 
product has geometric meaning only when restricted to vectors in the plane. 
Another drawback is the representation of a circle. This model can be realized 
within the Wachter model, as will be shown in Example 3 of the last section. 

When we choose e§ = 0, we get the degenerate model. Historically, this 
model occurred in the form of dual quaternions and dual vector algebra, and 
has important applications in 3D geometry and mechanics. In this model, a 
point A is represented by multivector 1 + eg A A, where A is the vector in TZ^ 
from the origin to the point. The origin is represented by scalar 1. Moreover, 

— the signed area of triangle ABC is ((A — B) A (A — C))'" /2; 

— the square of the distance between points A, B is (A — B)^; 
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~ the signed ratio of collinear line segments AB with CD is {A — B)/{C — D); 

— a circle with center O and radius r is represented by the pair 

~ a line passing through point A and with direction I is represented by ^ + cq A 
AM] 

— the reflection of point C with respect to line {A, 1) is (Z + cq A ^ A Z)(l — cq A 
C){1 + eo/\AM) = l + eo/\{2A-C- 2{{A - C) ■ 1)1)] 

— the rotation of a point C induced by a spinor U is Adu{l + eo A C) = 
1 + Co A AdijiC)] 

— the translation of a point C along a vector Z is (1 + Cq A Z/2)(1 + cq A C)(1 + 
eg A 1/2) = 1 + eg A (C + Z); 

— the constraint that three points A, B, C are collinear is represented by (A — 
B)A{A-C) = 0] 

— the constraint that two lines AB, CD are parallel is represented by (A — B) A 
{C-D) = Q] 

— the constraint that two lines AB, CD are perpendicular is represented by 
{A-B)-{C-D) = Q. 

We see that in many representations in this model, we have to use the rep- 
resentations of points and lines in the Clifford model, instead of the ones in the 
model itself. We also have to represent a circle with a pair of algebraic entities. 
Another drawback is that the Euclidean transformations of the plane do not have 
a uniform representation, as can be seen from the representations of reflections, 
rotations and translations in the previous list. 

One advantage of this model is that a line has an algebraic representation 
that can be used directly in algebraic manipulations, although not of a single 
term. The degenerate model is more suitable for computations on Euclidean 
transformations of lines. This model can also be realized in the Wachter model, 
as will be shown in Example 4 in the last section. 

The Clifford, Grassmann and degenerate models can all be realized within 
the Wachter model. The Wachter model is capable of preserving the advantages 
of the other three models while making the representations and computations 
of distances, lines, planes, circles and spheres more easily. We shall given more 
examples in later sections. 



4 The Wachter Model for the Space 

4.1 Points, Lines, Planes, Circles and Spheres 

Similar to the 2D case, now consider the Minkowski space 7?.^’^, which contains 
the space Ti? as a subspace. Let (/, /g) be a hyperbolic pair of the Minkowski 
plane which is the orthogonal complement of . The set {x € = 0, a; • 

/ = —!}, together with the mapping Ai— >A = /g-|-A-|- (A^/2)/, forms a model 
for the Euclidean space, called the Wachter model of the space. 

In this model, the representations of a line and a circle by their points are 
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the same as in the 2D case. A plane passing through three points A, B, C has the 
representation f AAAB AC; a sphere passing through four points A,B,C,D has 
the representation AAB AC AD. Using geometric data, a sphere with center O 
and radius p is represented by (O — (p^/2)/)'^; a plane with unit normal n 
and n-distance 6 is represented by (n + Sf)'". 

A line with direction I and n-distance 6, where n is a unit normal of the 
line and is in the plane decided by the line and the origin of the space, can be 
represented by f A fo A I + 6 f A n A 1. This is because points 5n and 6n + I are 
both on the line. A circle on plane ABC, with center O and radius p, is the 
intersection of the plane with the sphere whose center is O and whose radius 
is p, therefore can be represented by (O — {p'^/2)f)~ V {f A A A B A C). 

The following table gives more representations of typical geometric entities 
in the 3D geometry. 



Geometric entities and constraints 


Representations 


Point A 


A 


Line AB 


f aAab 


Circle ABC 


AaB AC 


Plane ABC 


f aAabac 


The plane normal to unit vector n and 
with n-distance 5 


(n + Sf)^ 


The plane passing through point A and 
normal to vector n 


{n+{A-n)f)'^ 


The plane passing through point A and 
directions 1 , m 


f A A Al Am 


The normal vector of plane ABC 


ifAAABAC)--{fAfoAAABAC)-f 


The signed distance from the origin to 
plane ABC 


{/oaAaba C)~ 
\f A A AB AC\ 


Sphere ABCD 


AaB AC AD 
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The sphere with center O and radius p 


io--fr 


The center of sphere ABCD 


{A A B AC A Off -2 {AaB ACA 
D) ■ if aAaB AC AD) 


The square of the radius of sphere 
ABCD 


(AaBAC AD)^ 

{f aAaB AC AD)‘^ 


The circle on plane ABC, with center O 
and radius p 


, P^ , , , 

AAABAC) 


The common perpendicular of non- 
coplanar lines AB and CD 


f A{A{f AB AC AD) ■ N 4 
-B{fAAACAD) -Ni) 

A{C{f A A A B A D) ■ N 4 
—D{f A A A B AC) ■ N 4 ), where 
Ni = f AffiA{A-B)A{C -b) 


The distance between two non-coplanar 
lines AB, CD 


\f aAaB AC AD\ 
\{A-B)A{C-D)\ 


The ratio of the signed area of coplanar 
triangles ABC and DEF 


f aAabac 

f AD AE AF 


The signed volume of tetrahedron 
ABCD 


if aAaB AC AD)'- 



4.2 The AfRne Representations of Points, Lines and Planes 

For a geometric problem involving only points, lines and planes, it is better to 
use fAA instead of A to represent point A. The reason is, the set {f A A\ A £ TZ^} 
is affine: 

XfAB+{l-X)fAC = fAD, 

where D = XB + (1 — A)C. This representation is called the affine representation 
of a point. The affine representation of a line (plane) is the representation used 
in the Wachter model. 

We mention only one application here. Let be a Minkowski r-blade, then 

f A Ar represents a point, line, or plane when r = 1,2, or 3. For two such 

blades where r < s, the projection of f A Aj. onto f A Ag is 

Pf^AAf ^Ar) = f APf^AMr)- (10) 

It is still a Minkowski r-blade, and represents a geometric entity. When explained 
geometrically, (10) says that the feet drawn from point A to line BC and plane 



Some Applications of Clifford Algebra to Geometries 171 



BCD are and f ^Pf^shChD^-^) respectively, and the projection 

of line AB onto plane CDE is / A ^ PfECADAE^P)- 

5 Applications 

Example 1. Simson triangle. 

Let ABC be a triangle, H be a point on the plane. Draw perpendicular lines 
from D to the three sides AB, BC, CA of triangle ABC. Let Ci, A\, B\ be the 
three feet respectively. Triangle AiB\Ci is the so-called Simson triangle. 




When A, B, C, D, Ai,Bi,Ci are understood to be null vectors, we have the 
following conclusion: 



f A Ai A Bi A Cl 



AABAC AD 

V 



( 11 ) 



where p is the radius of circle ABC. (11) can be used to compute the signed 
area of the Simson triangle. 

An immediate corollary of (11) is: when D is constrained on a circle concentric 
with circle ABC, the area of the Simson triangle is constant. In particular, if D 
is on circle ABC, then Ai,Bi,C\ are collinear, which is the classical Simson’s 
theorem. 

Now we prove (11). Point Ai is the foot drawn from point D to line BC, 
therefore, 

f AAi=Pf^BAcif AD). (12) 

A direct computation gives 



fAAi 



f A PfABAc{D) 

B + C D-{B-C){B-C) 

f 77^-^ ). 



2 



2B ■ C 



( 13 ) 
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Similarly we have 



+ ^ D-{C-A){C-A)^ 

/A 1 /A( 2 2C-A 

+ B D-{A-B){A-B) ^ 
/ 1 / ( 2 2A-B 



Now 



where 



f A Ai A B\ A Cl — 



f AA/\B AC 
4A-BB-CC ■ A^ 



B 



r = A - BB ■ CC ■ A + A- BD ■ {B - C)D ■ {C - A) 

+D-{A- B)B - CD ■ {C - A) + D ■ {A - B)D ■ {B - C)C ■ A. 

The right-hand side of (11) equals 

{A A B AC A D){f AAAB ACf 



2{AABACy 
f AAABAC 



{A A B A C A D) ■ (f A A A B A C), 



AA- BB -CC ■ A' 
therefore we only need to prove 

r ={AAB AC AD)-{f AAAB AC). 



(14) 



(15) 



(16) 



(17) 



(18) 



The only constraint among vectors /, A^ B, C, D is that they are in the same 
4D space, i.e., fAAABACAD = 0. So 



{fAAABACAD)-{fAAABACAD) = 0. (19) 

We are going to see that the equality (18) is identical to (19). This can be proved 
in various ways, for example, we can use affine coordinates of D with A, B, C 
as the affine basis, and express everything with coordinates, or we can use the 
method of straightening laws in [31], by which we can keep geometric meaning 
in the proving procedure. The following are some direct computations without 
expanding everything to inner products of vectors. 

First, we expand the left-hand side of (19) in the following way: 

{{f A A A B AC A D) ■ f) ■ {A A B AC A D) 

= -if AAAB AC) ■ {AAB AC AD) (20) 

+{f A {A A B + B A C + C A A) A D) ■ {A A B A C A D). 

From this, using 

{f AAAB AC)-{AAB AC AD) 

= -{A AB AC) - if - {A A B AC AD)) 

= {Aabac)-{{Aab + bac + caA)ad)-{Aabac)^ 



( 21 ) 
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and 

if A (A AB + BAC+CA A) AD) -{A ABACA D) 

= -{A AB + B AC +C AA) AD) ■ {f ■ {AAB AC AD)) 

= {D A{A- B) A{B- C))2 - {A A B AC) ■ {{A A B + B AC + C A A) A D), 

we obtain 

{f AAAB AC AD)'^ = {D A{A- B) A{B - C))^ + (A A B A C)^ 

-2(A A B A C) ■ {{A A B + B A C + C A A) A D). 

Second, using (21), together with 

{AaBAC)^ = -2A -BB-CC-A 

and 

{DA{A-B)A{B- C))2 = 2(A -BD-{B- C)D ■ {C - A) 

+D-{A- B)B ■CD-{C-A)+D-{A- B)D ■ {B - C)C ■ A), 

we obtain 

r - {AAB AC AD) -{f AAAB AC) 

= A{A-B)A{B- C))2 + {AABA C)^) 

-{A abac)-{{Aab + bac + caA)ad) 

= AAAB AC AD)"^. 

This ends the proof. 

Example 2. The direct kinematic problem of a Steward platform. 

This example is cited from the work of B. Mourrain [31,32,33]. Consider a 
platform attached to the ground by six extensible bars. Changing the lengths 
of the bars slightly entails a small displacement of the platform so that this 
mechanism is suited for doing precise movements of adjustment. This parallel 
robot, also called Steawrd platform or left hand, can be used as flying simulator 
or hand of a serial robot. 

The direct kinematic problem of this robot can be described as follows: let Xi, 
i = 1 , . . . , 6 be the endpoints of the bars on the ground, Zi, i = 1 , . . . , 6 be the 
points attached to the platform, which is moved by a displacement D = (R,T), 
where i? is a 3D rotation, T is a 3D parallel displacement. The problem is, given 
the lengths of the six bars \Xi — Zi \ = h, i = 1, . . . , 6, And the displacement D. 

In the Wachter model, the constraint that every Xi is transformed to Zi by 
a rigid body motion, where i = 1 , . . . , 6, can be expressed by 

u~^fu = /, i, = u~^x,u, 

where D is a spinor in ^44. The constraint \Xi — Zi\ = li is equivalent to 
Z,-X, = {U-^X,U)-X, = -^. 



( 22 ) 

(23) 

(24) 

(25) 

(26) 
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Using the above formulation and results from algebraic geometry, it can be 
proved that the number of solutions for D is at most 40 [33]. 

Example 3. The Grassmann model of within the Wachter model. 

In the Wachter model of 7?,", let /, /g be the two null vectors representing the 
point at infinity and the origin respectively. Let F = {A + fo\A G TZ^}. Then F 
is an affine hyperplane of 7?," 0 (/o). The mapping 

A^ A + fo, for AgTZ^ (27) 

is one-to-one and onto F. It maps the origin to /g. 

Define an operator d in Q{TZA 0 (/g)) as follows: 

d{x) = -f -X, 

for X e 0 (/g)). Then d{A) = 0 and d{A + fo) = 1 for A € TZ^. The 

Clifford algebra 1/(7?." 0(/g)), together with the mapping (27) and the operator d, 
constitutes a model that is identifiable with the Grassmann model of TZ^. 

There is another realization of the Grassmann model within the Wachter 
model, by means of the affine representations of points, lines, planes, etc. In the 
Wachter model, an (r — l)-dimensional plane passing through r points Ai, . . . ,Ar 
can be represented by 

/ ^ ^1 A ■ ■ ■ A Aj~ ("OS') 

= / A Ai A • • • A Ar + / A /g A {A2 — ^1) A • • • A {Aj. — Ai). 

In the set {f A x\x G Gn+1,1} we define the following outer product g: 

g{fAx,fAy) = fAxAy, 



for x,y G Gn+ 1 , 1 - We also define an operator i9 in it: 

d{f Ax) = if Afo) ■ if Ax), 

for X G Gn+1,1— TZ, and 9(A/) = 0 for A S 7?.. Then difAA) = 0 and difAA) = 1 
for A G TZ^. The Clifford algebra Gn+ 1 , 1 , together with the affine representations, 
the outer product g and the operator d, provides another realization of the 
Grassmann model. 

Example 4- The degenerate model of the space. 

The degenerate model can be derived from the affine representations of 
points, lines, planes, etc. Let Ai,...,Ar be r points in the space. Then (28) 
gives a representation of the (r — l)-dimensional plane passing through them. 
The dual of this representation with respect to /g A / gives 

(/Ail A---Ai^)(/A/g) = 

iA2 - Ai) A ■ ■ ■ AiAr - Ai) + f A Ai A ■■ ■ AAr- ^ ’ 



This is exactly the degenerate model of the space. For example, point A is 
represented by 1 0 / A line AB is represented hy B — A+ f AAAB, plane 
ABC is represented hyAAB + BAC + CAA + fAAABAC. 
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One advantage of the degenerate model is that only one null vector occurs in 
the representation, which will simplify computation when no dilation is involved. 
To compensate for this advantage, we need to redefine the action of a versor on 
a geometric entity M as follows: let 17 be a versor inducing a Euclidean transfor- 
mation. Let M' = M{f A fo). Then the image of M' under the transformation 
is 

Ad\j{M’) = U*~^MU, ( 30 ) 

where the overbar represents an outermorphism that satisfies A = A for A G TZ^, 
and f = —f- An outermorphism g in a Clifford algebra is a linear transformation 
within the Clifford algebra that satisfies 



g{ai A ... A Ur) = g{ai) A • • • A g{ar), 

for any multivectors oi, . . . , a^.. 

Now we can define a new spinor action “Ad**” as follows: 

Ad*y{M) = V**~^MV, ( 31 ) 

where V = U, the double-star symbol represents an outermorphism that satis- 
fies A** = —A for A G TZ^, and /** = /. In this way, the spinor inducing the 
rotation from direction m to direction n is m(rn + n), the spinor inducing the 
translation along vector I is 1 -I- / A Z/2. 

The geometric relationship between two geometric entities Mi, M2 is MiM|. 

Example 5 . Dual quaternions. 

Dual quaternions was first used by W. K. Clifford [6] in geometries. Later on it 
was used to solve problems in mechanics [ 5 ]. The definition of dual quaternions 
is: let Q be the set of quaternions, e be a nilpotent algebraic element which 
commutes with every quaternion. The set Q + eQ is called dual quaternions. 
The set TZ + eTZ is called dual numbers. 

As shown in [ 17 ], quaternions are naturally included in the Clifford model of 
the space: let {61,62,63} be an orthonormal basis of TZ^. Then the correspon- 
dence: i 1-^- 6i A 62, j I— > 62 A 63, fc I— > 61 A 63, gives a Clifford algebra isomorphism 
of Q and G^. 

Dual quaternions is another form of the degenerate model of the space. In 
the degenerate model, let e = / A fy, where I 3 is a unit pseudo-scalar of G3. 
Then G3 is a vector space spanned by 1 , 61 A 62, 62 A 63, 61 A 63, and eG^ is a 
vector space spanned by / A ei, / A 62, / A 63, / A fy. Obviously e commutes with 
everything in G3 ■ As a consequence, G'^if A I 3 ) = ^^ -|- eG^ is isomorphic to 
dual quaternions as Clifford algebras. 

In the degenerate model, the representations of point A, line (A, 1 ) and plane 
(A,l,m) can be written as 



1-H/aA = 1-beA-L; 

{I + f A A A !)■*■ = fy -I- eA-*- x 
I Am + f A A Al Am = I Am + e(A A I A m)-*-. 



( 32 ) 
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Here represents the dual in Q3. From this we get the dual quaternions 
representation of the space: point A is represented by 

1 + G 1 + eQoi (33) 

where Qq is the set of pure quaternions together with zero; line where I 

is a unit vector, is represented by 

+ eA“*“ X € Qo “1“ cQoi (34) 

where Qq is the set of unit pure quaternions; a plane passing through point A 
and normal to unit vector n is represented by 

■ A~^ S Qo + (35) 

In the dual quaternions model, the action of versors needs to be redefined. 
For a point or plane, the action of a versor U is Ad^ . For a line, the action is 
Adjj. More explicitly, the versor generating the reflection with respect to a plane 
with unit normal n and n-distance 6 is 

U = {n — Sf)-^ = «■'■ + C(5 G Qo + £"7^, (36) 

the image of geometric entity M , which is either a point, a line, or a plane, under 
this reflection is 

UMU, or UMU-^, or UMU. (37) 

The spinor generating the rotation from point A to point B centering at the 
origin is 

V = -A{A + B)=A^{A^+B^)€Q, (38) 

the image of geometric entity M under this rotation is 

V~^MV. (39) 

The spinor generating the translation from point A to point B is 

fj — A R-*- — d-*- 

W=l + f^^ = l + e ^ Gl + eQo, (40) 

the image of geometric entity M, which is either a point, a straight line, or a 
plane, under this translation is 

WMW, or W~^MW, or WMW. (41) 

As to the geometric relationship between two entities Mi, M2, if Mi is not 
a line but M2 is, the relationship is represented by Mi M 2 , otherwise it is repre- 
sented by MiM\. 

Example 6. Dual vector algebra. 

First let us explain what is dual vector algebra. We know that a vector algebra 
is a three dimensional vector space equipped with a inner product and a cross 



Some Applications of Clifford Algebra to Geometries 177 



product of vectors. Now let e be an algebraic nilpotent element that commutes 
with everything. The set + eV^ is called dual vectors. Dual vectors, equipped 
with the inner product and cross product from the vector algebra, form the dual 
vector algebra of V^. 

Let us start from the dual quaternions model of TZ^. We know that Qq, 
the set of pure quaternions and zero, is isomorphic to TZ~^ as vector algebra, 
under the correspondence between {f , j, k} and an orthonormal basis of TZ~^, and 
the correspondences between the inner products and the cross products in Qo 
and TZ~^ respectively. As a corollary, Qo+eQo and the dual vector algebra of TZ~^ 
are isomorphic as vector algebras. This isometry identifies the dual quaternions 
model of TZ^ with the dual vector algebra model of TZ~^: point A is represented 
by 

1 + eA; (42) 

line (A,l) is represented by 

l + eAxl, (43) 

where “x” is the cross product in vector algebra; the plane passing through 
point A and normal to vector n is represented by 

n + en ■ A. (44) 

The geometric product of two dual vectors x, y satisfies 

xy = x- y + xxy. (45) 

If we use the inner product in TZ^ for vectors, we get the dual vector algebra 
of 7^3. 

One application of dual vector algebra in geometry is the computation of two 
non-coplanar lines [37,38]. For lines (A, 1), {B, to), let n be the unit vector in the 
direction of I x m, 6 he the distance between the two lines, 9 be the angle from I 
to TO. Let P be the intersection of the common perpendicular of the two lines 
with line (A,l). Then 



(/ + eA X l){m + eB X m) = e®", 

where 

e®" = cos 9 + n sin 9, 

9 = 9 + eS, n = n + eP x n, and 

cos 9 = cos 9 — e6 sin 9, 
sin 9 = sin 9 + eS cos 9. 



(46) 

(47) 



(48) 



Formula (46) is equivalent to the following result by the Wachter model: 

(/ A A A /)(/ A B Am) 

= l- m+ f A {B — A) AlAm + lAm 

+l ■ mf A {B — A) + f A {A ■ ml — B ■ Im). 



( 49 ) 
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The general form of (49) in nD geometry is [27]: let Ir^Is be r-blade, s-blade 
in Qn respectively, 0 < r, s < n, then 



if A A A Ir){f A B A Is) 

— ((/ ^ ^)(/ -®)) (-^(/a4)~ (-^’■)-^(/ab)~ 
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Abstract. An algebraic variety is a geometric figure defined by the zeros 
of a set of multivariate polynomials. This paper explains how to adapt 
two general zero decomposition methods for efficient decomposition of 
affine algebraic varieties into unmixed and irreducible components. Two 
devices based on Grobner bases are presented for computing the gen- 
erators of the saturated ideals of triangular sets. We also discuss a few 
techniques and variants which, when properly used, may speed up the 
decomposition. Experiments for a set of examples are reported with com- 
parison to show the performance and effectiveness of such techniques, 
variants and the whole decomposition methods. Several theoretical re- 
sults are stated along with the description of algorithms. The paper ends 
with a brief mention of some applications of variety decomposition. 



1 Introduction 

Let /C be a fixed ground field of characteristic 0, . . . , Xn] the ring of poly- 

nomials in the variables * 1 , . . . , (abbreviated x) with coefficients in /C, and /C 
some extension field of JC. For any non-empty polynomial set P C K\x], Zero(P) 
denotes the set of all common zeros in 1C of the polynomials in P. Any element of 
Zero(P) can then be considered as a point in an n-dimensional affine space AJg 
with coordinates x over A. Let QJ be a collection of points in A^; QJ is called 
an affine algebraic variety, or simply a variety, if there exists a polynomial set 
P C JC[x] such that QJ = Zero(P). We call P the defining set and P = 0 the 
defining equations of 03. Examples of algebraic varieties are algebraic curves and 
surfaces in three-dimensional affine space. 

A variety 233 is called a subvariety of another variety 23, which is denoted as 
233 C 23, if any point on 233 is also on 23. A variety 233 is called a true subvariety 
of 23 if 233 c 23 and 233 7 ^ 23. 

A variety 23 C A^ is said to be irreducible if it cannot be expressed as the 
union of two or more true subvarieties of 23. An irreducible subvariety of 23 is said 
to be irredundant if it is not contained in another irreducible subvariety of 23. An 
irredundant irreducible subvariety of 23 is also called an irreducible component 
of 23. The variety 23 is said to be unmixed if all its irreducible components have 
the same dimension. 

We want to decompose any affine algebraic variety defined by a given set of 
polynomials as the union of a family of irreducible subvarieties. One approach 
initiated by Wu [22] for this purpose consists of two major steps: first computing 
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an irreducible zero decomposition using the characteristic set method [12,22] and 
then determining the defining sets of the subvarieties from the irreducible as- 
cending sets (by means of which the zero decomposition is represented) . Wu [23] 
proposed a method based on Chow bases for determining the prime bases from 
the obtained irreducible ascending sets. The use of Grobner bases [2] was pro- 
posed in [4] and [14] independently as another method for computing such prime 
bases. The author has experimented the approach of irreducible variety decompo- 
sition based on the computation of characteristic sets and Grobner bases [15]; 16 
examples were given to illustrate the method and its practical performance. 

We remark that the technique of determining a prime basis from an irre- 
ducible ascending set using Grobner bases suggested in [4,14] works as well for 
determining a defining set G of the corresponding variety from any (non-trivial) 
ascending set A; this was shown in [7]. The set G is actually a finite basis for 
the saturated ideal Ideal(A) : J°°, where Ideal(A) denotes the ideal generated 
by the polynomials in A and J is the product of the initials of the polynomials 
in A. The same technique of computing a finite basis for an arbitrary saturated 
ideal was already given in [8]; see also [5,20]. Using this general technique and a 
theorem about the dimensionality of any ascending set in [7], an unmixed variety 
decomposition can be readily obtained from any proper zero decomposition. 

In [16,19,21] are presented two general methods for decomposing any polyno- 
mial system into triangular systems of different kinds, which make use of some 
early elimination ideas of A. Seidenberg and J. M. Thomas with top-down elim- 
ination, splitting and subresultant regular subchain as computational strategies. 
Other decomposition methods have been proposed in [11] and [9,10]: the for- 
mer emphasizes the canonicality of the computed triangular sets with various 
requirements while the latter aims at computing the variety decomposition rep- 
resented by regular and irreducible triangular sets. These methods may be used 
to replace Ritt-Wu’s method of characteristic sets for irreducible or unmixed 
decomposition of algebraic varieties. Note that replacing one method of zero de- 
composition by another, faster or slower, does not necessarily yield a superior 
or inferior method for the decomposition of varieties. For the triangular sets 
computed by different algorithms usually have different forms, that affect the 
subsequent computation of saturation bases. Therefore, investigating other zero 
decomposition methods for variety decomposition has practical significance. 

One purpose of this paper is to use our zero decomposition methods instead 
of Ritt-Wu’s for decomposing algebraic varieties into immixed and irreducible 
components. In Section 3, a different algorithm for constructing the finite bases 
for saturation ideals is recalled; this method proceeds by computing successively 
ideal quotients using Grobner bases. Meanwhile, we explain several variants and 
techniques which can enhance the practical efficiency in different cases. A number 
of properties for variety decomposition are presented and show, in particular, the 
significance of the concept and computation of regular sets. The formal proofs for 
most of the theoretical results are omitted and can be found from the indicated 
references. Many of them are contained in the author’s forthcoming book [21]. 
Experiments with our decomposition methods based on Seidenberg’s elimination 
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idea and subresultant regular subchains and the previous one using characteristic 
sets are reported in table form for 22 examples. Our methods are faster in terms 
of computing time for most of the complete cases. Some of the new techniques, 
which do not lead to any additional cost, can speed up the decomposition many 
times when they succeed. The paper ends with a short summary of applications 
of variety decomposition. 

2 Zero Decomposition 

Let the variables be ordered as xi ^ • • • -< Xn- For any Xi and P G IC[x], the 
degree and leading coefficient of P in Xi are denoted by deg(P, Xi) and lcoef(P, Xi) 
respectively. If P is not a constant (i.e., P ^ /C), we call the smallest index p such 
that P G . . . ,a;p] the class of P; Xp the leading variable of P, denoted by 
Ivar(P); lcoef(P, Xp) the initial of P, denoted by ini(P); and P — ini(P)xp'*®^’^’^*’^ 
the reductum of P, denoted by red(P). For any two polynomials P and Q with 
Q ^ IC, prem(P, Q) denotes the pseudo-remainder of P with respect to Q in 
Ivar(Q). 

An ordered set T = [Pi , . . . , P^] of r non-constant polynomials is called a 
triangular set if Ivar(Pi) ^ ^ Ivar(Pr). For any polynomial P, 

prem(P, T) = prem(- • • prem(P, P^), . . . , Pi) 

is called the pseudo-remainder of P with respect to T; the symbol = reads as 
“defined to be.” For any polynomial set P, prem(P, P^) stands for {prem(P, P^) | 
P G P}, prem(P, T) for {prem(P, T) | P G P}, Ivar(P) for {Ivar(P) | P G P}, 
and ini(P) for {ini(P) | P G P}. 

Let [P, Q] be a pair of polynomial sets, called a polynomial system, with which 
the zero set Zero(P/Q) is of concern, where 

Zero(P/Q) A Zero(P) \ [J Zero(Q). 

QeQ 

We write Zero(P) for Zero(P/0) and Zero(P/Q) for Zero(P/{Q}), etc. 

A triangular system is a pair [T, U] in which T is a triangular set and U a 
polynomial set such that ini(P) does not vanish on Zero(T/U) for any P G T. A 
triangular system [T, Ul is said to be fine if 0 ^ prem(U, T). A triangular set T 
is fine if [T,ini(T)] is fine. 

By zero decomposition we mean to compute for any given non-empty polyno- 
mial set P C /C[a;] a finite set P which is either empty (in this case Zero(P) = 0 
in any extension field of /C) or of the form P={[Ti,Ui],..., [Te,Ue]} such that 

e 

Zero(P) = [J Zero(T^/U^), (1) 

where each [Ti,lUi] is a fine triangular system. In other words, either the empti- 
ness of Zero(P) is detected or the polynomial set P is decomposed into finitely 
many fine triangular systems [Ti,Ui]. We have been studying three different yet 
related algorithms reviewed below for such zero decomposition. 
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Ritt-Wu’s Algorithm 

This is a well-known algorithm attributed to Ritt [12] and Wu [22] which starts by 
computing a characteristic set C of the given polynomial set P. In the trivial case, 
C is contradictory (i.e., C consists of a single constant) and thus Zero(P) = 0. 
Otherwise, the computed C is a {weak) ascending set (that is a triangular set 
with some restriction on the degrees of polynomials) such that 

Zero(P) C Zero(C), Zero(C/I) C Zero(P), 

where I = ini(C). In fact, prem(P, C) = {0} holds as well. It follows from the 
above zero relations that 

Zero(P) = Zero(C/I) U Zero(PU {/}). 

/GI 

One can proceed further by computing a characteristic set of each PU {/} in the 
same manner. This process will terminate so that a zero decomposition of the 
form (1) is reached, with = Q and Ui = ini(Ti). In addition, the property 
prem(P, Ci) = {0} holds for all i. See [3,12,17,22] for details. 



Seidenberg’s Algorithm Refined 

Now let us explain the algorithm described in [16] that computes the zero de- 
composition (1) by eliminating the variables successively from Xn to xi with 
recursive pseudo-division and by splitting the system whenever pseudo-division 
is performed. Concretely, for each Xk, k = n, . . . ,1, one proceeds as follows. 

The iteration starts with k = n. If P does not contain any polynomial of 
class k then go for fc — 1. Otherwise, let T € P be of class k and have minimal 
degree in Xk- Then 



P = 0 



P* = 0, / = 0, red(r) = 0, or 
prem(P, T) = 0, T = 0, 1^0, 



where 

P*=P\{T}, J = ini(T). 

Therefore, we have 



Zero(P) = Zero(P* U {/,red(T)}) U Zero(prem(P, T) U {T}/I). 

Note that for both P* U {/, red(T)} and prem(P, T) U {T} either the numbers of 
polynomials of class k decrease, or so do the minimal degrees of such polynomials 
in Xk (in comparison with P). The process continues for each split system [P', Q'] 
by taking from P' another polynomial of class k and having minimal degree in Xk 
(if there exists any) and splitting further. With every [P',Q'] one proceeds for 
k — 1 when P' contains none or only one polynomial T' of class fc; in the latter 
case, Q' is meanwhile replaced by prem(Q',T'). The system [P',Q'] is discarded 
whenever P' n /C \ {0} 0 or 0 € Q'. 

Clearly, the above procedure will terminate with a zero decomposition of the 
form (1). 
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Subresultant-Based Algorithm 

The third algorithm makes use of the key concept of subresultant regular sub- 
chains (s.r.s.) and their computation, for which the details can be found in [19]. 
The algorithm also employs a top-down elimination for Xk, k = n, . . . ,1. 

Let us begin with k = n. If, trivially, P does not contain any polynomial of 
class fc, then proceed next for k — 1. Consider the simple case in which P contains 
only one polynomial P of class fc; let / = ini(P). Then 



P = 0 



P = 0, / 0; or 

P\{P} = 0,/ = 0,red(P) = 0. 



Here two subsystems are produced. For the first, we have obtained a single 
polynomial P in Xk whose initial is assumed to be non-zero, so the process can 
continue for next k. For the second, the minimal degree in Xk of the polynomials 
of class k has decreased. So we can assume that the subsystem may be dealt 
with by induction. 

Now come to the more general case in which P contains two or more polyno- 
mials of class k. Let Pi, P 2 G P be two of them with P 2 having minimal degree 
in Xk and compute the s.r.s. H 2 , ■ ■ . ,Hr of Pi and P 2 with respect to Xk- Let 
I = lcoef(P 2 ,Xfc) and li = lcoef{Hi,Xk) for 2 < i < r. Then, by the properties 
of s.r.s. we have 



P= 0 



where 



It follows that 



P 2 = 0,7 = 0 ,red(P 2 ) = 0, or 



P 12 — 0, — 0, 

7i_i_i — 0, . . . , /t- — 0, 



1 ^ 0 , 



for some 2 < i < r, 



P2=P\{P2}, Pl2=P\{Pl,P2}. 



Zero(P) = Zero(P 2 U {/,red(P 2 )}) U (J Zero(Pi 2 U 

i=2 



To see the significance of the splitting, one may note that the number of the 
polynomials of class k in P 12 U {Hi, li+i, . . . ,Ir} is at least one less than that of 
the polynomials of class k in P. The process may be continued by iterating the 
above step for each split system [P' , Q'j . The remarks at the end of the preceding 
subsection should also be added here. We may thus devise another algorithm to 
compute zero decompositions of the form (1). Such an algorithm is described 
formally in [21, Sect. 2.3]. 

A triangular system [T, U] or a triangular set T is said to be quasi-irreducible 
if all the polynomials in T are irreducible over 1C. Making use of polynomial 
factorization over the ground field /C, one can compute zero decompositions of 
the form (1) with each triangular system [Ti,Ui] quasi-irreducible. 
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Computing Regular and Irreducible Triangular Systems 

Let T = [Ti, . . . , Tr] be a fine triangular set with Xp^ = Ivar(Tj) and Ij = ini(Tj) 
for 1 < j < r. T is said to be regular or called a regular set if 

res( ■ ■ • res(res(/j-|_i , Tj , Xp ^ ) , Tj —\ , Xp^_-^ T \ , Xp ^ ) ^ 0 

for 2 < j < r, where res(F, G, x) denotes the resultant of F and G with respect 
to x. 

A fine triangular system [T, U] is called a regular system if T is regular, 
Ivar(T) n Ivar(U) = 0, and ini(?7) does not vanish on Zero(T/U) for any U gV. 

An algorithm is presented in [21, Sect. 5.1] that computes zero decomposi- 
tions of the form (1) with each [T^, Ui] a regular system. The main new ingredient 
of this algorithm is to use the only polynomial P 2 of class k in P', when obtained, 
to eliminate all the polynomials of class k from Q', where [P^Q'] is the poly- 
nomial system under consideration and ini(P 2 ) does not vanish on Zero(PYQ'). 
The elimination is performed with further splitting roughly as follows. 

Assume that Q' contains a polynomial, say Pi, of class k. Compute the 
s.r.s. H 2 , . . . ,Hr of Pi and P 2 with respect to Xk, and let li = lcoef{Hi,Xk) for 

2 < i < r. Then, it follows from the properties of s.r.s. that 

r 

Zero(PVQ') = |J Zero(P' \ {P 2 } U {T„/.+i, . . ■ , Y}/Q' U {/J), 

i=2 

where Li is the pseudo-quotient of P 2 by Hi with respect to Xk- Observe that, 
when Hi is of class k, Li has smaller degree than P 2 in Xk- It is always the case 
for 2 < i < r — 1. If Hr has class smaller than k, then 

Zero(P' \ {P 2 } U {Lr}/Q' U {/,}) = Zero(P'/Q' \ {Pi} U {/,}), 

i.e.. Pi may be eliminated from Q'. The above process can be repeated for 
each split system so that all the polynomials of class k in the second set of the 
system are eliminated. In this way, we shall obtain from [P', Q'j finitely many new 
systems [P', Q'j such that each P' contains either none or exactly one polynomial 
of class k, while in the latter case Q' does not contain any polynomial of class k. 

A fine triangular set T as above is said to be irreducible if each Tj, as a 
polynomial of , is irreducible over K.{xi , . . . , Xp^._i), where Xp ^ , • ■ • , are 
adjoined as algebraic elements by the equations Pi = 0, ... , Tj_i = 0 in the case 

3 > 1 - 

A fine triangular system [T, U] is said to be irreducible if T is irreducible. 

If T is reducible, one can find a k such that 

= [T,,...,Tk-i] 

is irreducible and the polynomial Tk has an irreducible factorization of the form 



PTfc = Pi • • • Ft 



186 Dongming Wang 



over the extension field . . . , with the algebraic 

elements Xp^ , . . Xp^_^^ adjoined by Ti = 0, . . . , Tk-i = 0, where 

D e JC[xi, . . . Fj e JC[xi,. . .,Xp^] 

all have degree smaller than deg{Ti,Xp^) for 1 < z < fc — 1 and the dot equality 
means that prem(HTfc — Fi • • • = 0 (see [21, Sect. 7.5]). 

Therefore, for each triangular system [Ti,Ui] in (1) with reducible, say 
having the same form as T above, one may get a further zero decomposition of 
the form 

t 

Zero(T,/U,) = Zero({T>} U T,/U,) U |J Zero(T^/U, U {D}), (2) 

i=i 

where each T^- is an ordered polynomial set obtained from by replacing Tk 
with Fj. 

Let Uij = prem(Ui U {D}, T^). If contains 0 for some j, then the corre- 
sponding component in (2) can be simply removed. It is easy to see that for those 
components in which does not contain 0, [Ty, Uy] is still a fine triangular 
system and its irreducibility can be further verified. 

The polynomial set {D} U may no longer be in triangular form, but it 
can be further triangularized by applying the same algorithm as for P. In this 
way we shall finally arrive at a zero decomposition of the form (1) with each 
triangular set irreducible. 

In case Ritt-Wu’s algorithm is not used, the computed Ui may contain many 
more polynomials than actually needed. The redundant polynomials can be 
deleted by using the following theorem [16,21], which is of both theoretical in- 
terest and practical importance. 

Theorem 1. Let P be any polynomial set in IC[x] having a zero decomposition 
of the form (1) with each [T^, Ui] a regular or irreducible triangular system. Then 

e 

Zero{F) = Zero{Ti/ini{Ti)). (3) 

i=l 

Let P have a zero decomposition of the form (3) with each a fine triangular 
set and F = {Ti,...,Te}. We set F = % when Zero(P) = 0 is verified. F 
is called a W- characteristic series of P if prem(P, T^) = {0} for each i. F is 
called a regular series of P if all are regular. In general, a regular series 
is not necessarily a W-characteristic series of P (i.e., there is no guarantee for 
prem(P, Ti) = {0}). However, it is proved in [21] that there exists an integer q 
such that prem(P'^, T^) = 0 for all P G P when is regular. 

3 Decomposition of Varieties 

In the decomposition (3), Zero(Ti/ini(Ti)) is not necessarily an algebraic vari- 
ety. We need to construct from each a finite set P^ of polynomials, called a 
saturation basis of T^, that defines a subvariety of Zero(P). 
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Ideal Saturation for Triangular Sets 

Let 3 be an ideal and F a polynomial in /C[a:]. The saturation of J with respect 
to F is the ideal 

3 : = {P G lC[x] I F'^P G 3 for some integer q > 0}. 

For any non-empty polynomial set P C /C[a;], Ideal(P) denotes the ideal generated 
by (the polynomials in) P. A finite basis for 3 : F°° can be computed by using 
Grobner bases [2] according to the following lemma. 

Lemma 1. Let 3 be an ideal generated by P and F a polynomial in 
Fi , . . . , Ft be t factors of F such that Fi • • • Ft yf 0 <i=^ F yf 0; 

P* = PU{zF-l}, P* = PU{ziFi-l I l<i<t}, 

where z, Z\, . . . , Zt are new indeterminates; and G*,G* be the Grobner bases of 
P* in IC[x, z] and o/P* in IC[x, z\, , Zt] with respect to the purely lexicographical 
term ordering (plex) determined with xi < z and xi -< Zj, respectively. Then 

3: F°° = Ideal{^*) ^JC[x] = Ideal{G* n K.[x]) 

= Ideal{F*) nlC[x] = Ideal{G* nlC[x]). 

Proof See [8,7,20], 

In fact, for the Grobner bases computation any compatible elimination order- 
ing in which x^f ■ ■ ■ x]^ A z (or Zj) does. As we have mentioned in Section 1, the 
above technique of computing saturation bases was introduced independently 
by several researchers. The result stated in Lemma 1 appeared first in [8]. The 
special case of the lemma, in which P is an irreducible ascending set A and F is 
the product of the initials of the polynomials in A, was proved in [4,14]. A proof 
for a slightly different version of the lemma may be found in [7]. The lemma 
in the above form is stated and proved in [20,21]. What is explained below is 
another method for determining a finite basis for any 3 : F°°; this method has 
been presented in [.5]. 

Let 3 and 5 be two ideals in K\x]. The infinite set of polynomials 

a : a = {F G K.[x] 1 FG G a for all G G 3} 

is called the ideal quotient of 3 by fj. It is easy to show that in K,[x\ the quotient 
of two ideals is an ideal (see, e.g., [5, p. 193]). For any polynomial F, we write 
3 : F instead of 3 : Ideal({F}). 

Lemma 2. Let 3 be an ideal and F a polynomial in IC[x], and let k be an 
integer > 1. Then 

3 : F°^ = 3 : F^ 3 : F’^ = 3 : F^+\ 

As a consequence, the minimal k can be determined by computing 3 : F* with i 
increasing from 1 . 
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Proof. Exercise in [5, p. 196]. 

This lemma provides an algorithm for determining a basis for 3 : F°° by 
computing the bases for the ideal quotients 3 : with k increasing from 1 . The 

basis is obtained when 3 : F^ = 3 : F^~^^ is verified for some k; in this case 
3\ F>^ = 3: 

A basis for the ideal quotient 3 : G can be computed by using Grobner bases 
in the following way. Let 3 be generated by Fi, . . . ,Fs- Compute the Grobner 
basis G of 

{zFu...,zF,,(l-z)G} 

in JC[x, z] with respect to an elimination term ordering with Xi ^ z and let 

U = {H/G\ HGGnlC[x]}, 

where z is a new indeterminate. Then 3 : G = Ideal(H) and thus H is a finite 
basis for 3 : G. See [5,8] for the correctness of this method. 

Let T be any triangular set in /C[a;]. The saturation of T is the ideal 

sat(T) A Ideal(T) : 



where J = rireT ini('T). 

Let P be a finite basis for sat(T); the following relation is obvious 
Ideal(T) c sat(T) = Ideal(P). 

Theorem 2. Let T C /C[a;] be any triangular set. Then T is regular if and only 
*/ 

saf(T) = {P g K.[x] \ prem{P,T) = 0}. 

Proof. See [1] and [21, Sect. 6.2]. 

For any irreducible triangular set T, Theorem 5 below asserts that sat(T) is a 
prime ideal. For any F G K\x], if prem(F, T) ^ 0, then F ^ sat(T) according to 
Theorem 2 and thus sat(T) : F°° = sat(T) by definition. This result is generalized 
in the following lemma for regular sets. 

Lemma 3. Let T he a regular set and F any polynomial in IC\x]. LfreslF, T) 0, 
then sat{T):F- = satin 

Proof. See [21, Sect. 6.2]. 

Proposition 1. Let [T,U] be a regular system in IC[x] and V = Jlc/gu ^ ■ Then 

IdealiT) : = saf(T). (4) 

Proof. See [21, Sect. 6.2]. 

As a consequence of (4), we have Zero(Ideal(T) : = Zero(sat(T)). 
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Unmixed Decomposition 

Refer to the zero decomposition (3) which provides a representation of the va- 
riety QJ defined by P in terms of its subvarieties determined by T^. As we have 
mentioned, each Zero(Ti/ini(Ti)) is not necessarily an algebraic variety; it is 
a quasi- algebraic variety. From the results above, we have seen that a corre- 
sponding set of polynomials may be obtained by determining, from each T^, a 
saturation basis with Grobner bases computation. 

Theorem 3. Let ^ be a non-empty polynomial set in IC[x] and {Ti, . . . ,Te} a 
W- characteristic series or a regular series of P. Then 

e 

ZerofF) = [J Zero{sat{Ti)). (5) 

i=l 



Proof. See [7] and [21, Sect. 6.2]. 

Let [S'] denote the number of elements in the set S. The following result used 
by Chou and Gao [3] provides a useful criterion for removing some redundant 
subvarieties in the decomposition (5) without computing their defining sets. 

Lemma 4. Let P and be as in Theorem 3. Lf |Tj| > jPj, then 
Zero{sat{Tj)) C Zero{sat(Ti)); 

1 < z < e 

i / j 

thus Zero{sat{Tj)) can be deleted from (5). 

Proof. As |Tj| > jPj, the dimension of T^- is smaller than n — jPj. By the affine 
dimension theorem (see [3] or [21, Sect. 6.1]) and Theorem 4, Zero(sat(Tj)) is a 
redundant component of Zero(P). 

A triangular set T is said to be perfect if Zero(T/ini(T)) ^ 0 in some suitable 
extension field of K.. The following result is due to Gao and Chou [7]. 

Theorem 4. Let T be any triangular set in /C[a;] . //T is not perfect then sat(T) = 
/C[a;]; i/T is perfect then Zero{sat(T)) is an unmixed variety of dimension n—\T\. 

Recall that any regular or irreducible triangular set T is perfect, so the variety 
Zero(sat(T)) is unmixed of dimension n — |T|. 

In (5), for each i let P^ be a finite basis for sat(Ti) which can be determined by 
computing Grobner bases according to Lemma 1 or by computing Ideal(Ti) : 
with k increasing from 1. If sat(Ti) = IC[x], then the constant I is contained in 
(the Grobner basis of) P^. Let us assume that such P^ is simply removed. Thus, 
a variety decomposition of the following form is obtained: 

e 

Zero(P) = [J Zero(Pj). 



( 6 ) 
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Apparently, Zero(P) = 0 when e = 0 (i.e., all such have been removed). By 
Theorem 4, each P^ defines an unmixed algebraic variety. 

Let QJi = Zero(Pi); then the decomposition (6) can be rewritten as 

= 2Ji U • • • U 9Je- 

This decomposition may be contractible; that is, some variety may be a sub- 
variety of another. Some of the redundant subvarieties may be easily removed 
by using Lemma 4. The following lemma points out how to remove some other 
redundant components. 

Lemma 5. Let G be a Grohner basis and P an arbitrary polynomial set in 
If every polynomial in P has normal form 0 modulo G, then Zero(G) C Zero(P). 

Proof. Since every polynomial in P has normal form 0 modulo G, Ideal(P) C 
Ideal(G). It follows that Zero(G) C Zero(P). 

The method for decomposing an algebraic variety into unmixed components 
explained above can be described in the following algorithmic form. 

Algorithm UnmVarDec (Input: P; Output: <F). Given a non-empty polynomial 
set P G K\x], this algorithm computes a finite set W of polynomial sets Pi, . . . , Pe 
such that the decomposition (6) holds and each P^ defines an unmixed algebraic 
variety. 

Ul. Compute a W-characteristic series or a regular series ^ of P and set •f'^0. 
U2. While yf 0 do: 

U2.1. Let T be an element of and set ^ \ {T}. If |T| > |P| then go to 
U2. 

U2.2. Compute a finite basis for sat(T) according to Lemma 1 or 2, let it 
be given as a Grobner basis G and set !?'<—!?' U {G}. 

U3. While 3G, G* € 'L such that the normal form of G modulo G* is 0 for all 
G G G do: 

Set W^L'\{G*}. 

The termination of the algorithm is obvious. The variety decomposition (6) 
and the unmixture of each Zero(Pi) is guaranteed by Lemma 1 and Theorem 4. 

Note that the unmixed decomposition (6) computed by Algorithm Unm- 
VarDec is not necessarily irredundant. To remove all redundant components, 
extra computation is required. The removal of redundant subvarieties by exam- 
ining the containment relations among the corresponding Grobner bases has the 
drawback that one component can be removed only if the corresponding Grobner 
basis has already been computed. The following lemma provides another crite- 
rion for removing redundant components. 

Lemma 6. Let T be a regular set in /C[a:] and P a finite basis for sat(T). IfP* 
is a polynomial set such that prem(P* ,T) = {0}, then Zero(P) C ZerofP*). 

Proof. Since T is regular and prem(P*,T) = {0}, P* C sat(T) by Theorem 2. It 
follows that 



Zero(P) = Zero(sat(T)) C Zero(P*). 
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Irreducible Decomposition 

Now we come to decompose an arbitrary algebraic variety defined by a polyno- 
mial set into a family of irreducible subvarieties. This is done with an analogy 
to the immixed decomposition of Zero(P), requiring in addition that each trian- 
gular set Ti in the zero decomposition is irreducible. Then any finite basis for 
sat(Ti) will define an irreducible variety with any generic zero of as its generic 
point. 

Note that any point ^ of an algebraic variety QJ over some extension of /C, 
which is such that every polynomial annulled by ^ vanishes on QJ, is called a 
generic point of QJ. A generic zero of an irreducible triangular set T is a zero 
(ail, . . . ,ain) of T in which Xi takes an indeterminate when Xi ^ Ivar(T). An ideal 
'3 <Z K\x\ is said to be prime if whenever F,G G K,[x] and FG G 3, either F G 3 
or Gg3. 

Theorem 5. For any irreducible triangular set T C /C[a;], the ideal sai(T) is 
prime. 

Proof. Let ^ be a generic zero of T; then 

prem(P, T) = 0 P{^) = 0 

for any P G K.[x] by Lemma 3 in [22]. Let FG G sat(T). Then prem(FG,T) = 0 
according to Theorem 2, so 

F(|)G(|) = 0. 

It follows that either F(^) = 0 or G(|) = 0; that is, either prem(F, T) = 0 
or prem(G,T) = 0. By Theorem 2, we have either F G sat(T) or G € sat(T). 
Therefore, sat(T) is prime. 

When sat(T) is prime, its finite basis is called a prime basis of T and denoted 
by PB(T). Then the variety defined by PB(T) should have any generic zero of T 
as its generic point. 

Proposition 2. Let Ti and T 2 be two irreducible triangular sets in JC[x] which 
have the same set of generic zeros. Then sat(Ti) = sai(T 2 ). 

Proof. See [21, Sect. 6.2]. 

Proposition 3. Let Ti and T 2 be two triangular sets in /C[a;] with lvar(Ti) = 
lvar(T 2 ), andT 2 be irreducible. //prem(T 2 , Ti) = {0}, thenTi is also irreducible 
and has the same set of generic zeros as T 2 ; thus sai(Ti) = sat(T 2 ). 

Proof. See [21, Sect. 6.2]. 

Proposition 3 generalizes a result in [3]; in the same paper the following is 
also proved. 

Proposition 4. Let Ti and T 2 be two triangular sets in /C[a;], of which Ti is 
irreducible. // prem(T 2 , Ti) = {0} and 0 ^ prem{ini(T 2 ),Ti), then sat(T 2 ) C 
sat(Ti). 
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Let each in (5) be irreducible. Then we have the following zero decompo- 
sition 

e 

Zero(P) = |JZero(PB(T,)). 

i=l 

Now each PB(Ti), which can be exactly determined by using Grobner bases, 
defines an irreducible algebraic variety and we have thus accomplished an irre- 
ducible decomposition of the variety QJ defined by P. 

This decomposition is not necessarily minimal. All the redundant subvarieties 
can be removed by using Proposition 4 and Lemma 6 or 5, so one can get a 
minimal irreducible decomposition. 

The following algorithm is modified from UnmVarDec and terminates obvi- 
ously. Its correctness follows from the above discussions and the remarks on early 
removal of redundant components in Section 4 (that ensures the decomposition 
minimal) . 

Algorithm IrrVarDec (Input: P; Output: W). Given a non-empty polynomial set 
P C K\x], this algorithm computes a finite set W of polynomial sets Pi, . . . ,Pe 
such that the decomposition ( 6 ) holds, it is minimal, and each P^ defines an 
irreducible algebraic variety. 

11. Gompute an irreducible W-characteristic series of P and set <?^{T S 
^1 |T| < |P|}, if^0. 

12. While ^ yf 0 do: 

12.1. Let T be an element of ^ of highest dimension and set ^ \ {T}. 

12.2. Gompute a finite basis G for sat(T) according to Lemma 1 or 2 and set 

U {G}. 

12.3. While 3T* S 'P such that prem(G,T*) = {0} do: 

Set 

4 Examples, Experiments and Techniques 

Here examples and experiments are presented mainly to illustrate irreducible 
decomposition which usually is more difficult. 

Three Illustrative Examples 

Example 1. Gonsider the polynomial set P = {Pi, P 2 , P’s}, where 

Pi = —X2X^X/i — X4 — X1X3 + X2 — X2 — Xi, 

P2 = X2x1 — 2 x\x\ + x\+ X\X2X'i — 2X2X3 + Xi, 

P 3 = 2 a; 2 a; 3 a ;4 -I- 2x4 + X 2 x\ — 2x\x\ + x\ + xi 0 : 2 X 3 — 2 x 22:3 -I- 2 x 1 X 3 -I- 2 x 2 + 2:1 

are all irreducible over Q (the field of rational numbers). P can be decomposed 
into three fine quasi-irreducible triangular systems [T, |x 2 ,/}], [T',|xi}], [T",0] 
such that 



Zero(P) = Zero(T/x 2 /) U Zero(T'/xi) U Zero(T"), 
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where 





1 

H 

1 

(M (M 

H 

1 


T = 


2:2X3 — 2x\xl + xl + X1X2X3 — 2x2X3 + Xl, 




1x4 + X1X2X3 + Xl 


T' = 


xl~Xi,I], T" = [xi, . . . ,X4] 



and I = X\X^ + X2- 

Let the three polynomials in the above T be denoted Ti , T2 , respectively. 

T is not regular because res(res(i, T2, X3), Ti, 12) = 0. In fact, T is reducible: one 
may verify that T2 has an irreducible factorization 

T 2 = —I{x3 - X2^ 

Xl 

over Q(xi,X2) with X2 having minimal polynomial T\. Therefore, 

Zero(T/a;2/) = Zero([Ti,a;3 — X2,T3]/x2l) = Zero(T* /X2I), 

where 

T* = [Ti,a;3 - X2,X4 + X 2 ]. 

It is easy to see that both T' and T" are irreducible. So we have, after applying 
Theorem 1 and removing a redundant component, an irreducible zero decompo- 
sition of the form 

Zero(P) = Zero(T*) U Zero(T7a;i). 

The three polynomials in T* all have constant initials, so T* defines by itself 
an irreducible algebraic variety. To determine a prime basis of T', we compute 
the Grobner basis G of T' U {zx\ — 1} over Q under plex with xi ^ ^ X4 ^ z. 

It is found that 

G = T' U {X2X3 + l,z - xl}, 
so the prime basis wanted is 

P' = T' U {X2X3 + I}. 

Hence we obtain finally the following irreducible decomposition of the variety 
defined by P 

Zero(P) = Zero(T*) U Zero(P7 

Finally we remark that the triangular set T' is irreducible, even though the 
second polynomial I in T' may be written as / = X2{x2X3 + 1) over Q(a;i,a;2) 
with X2 having minimal polynomial xl — x\. Let T = [xl — xi,X2X3 + V\. It follows 
from the factorized from of / and Theorem 1 that 

Zero(T'/a;i) = Zero(T/a;2) = Zero(T); 

the last equality holds because a;2a;3 -1-1 = 0 implies that X2 7 0- Therefore, P' 
can be replaced by T (in fact, P' is a plex Grobner basis of T). Simplifications of 
this kind to avoid the computation of saturation bases can be achieved by some 
other algorithms. 
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Example 2. Let the algebraic variety 03 be defined by P = {Pi, P 2 , P 3 }, where 
Pi = — X 2 + 2x1 — 2, 

P2 = 3xfx4 + 4x2X3 + 6X1X3 — 2x2 ~ 3 xiX 2 , 

P 3 = 3 x§X4 + X1X4 — X2X3 — X2- 

With Xi ^ ^ X4, P may be decomposed into 2 irreducible triangular sets Ti 

and T2 such that 



Zero(P) = Zero(Ti//) U Zero(T2/x2), 

where 

Ti = [Ti,T2,/X4 — 2X2 ~ 3 X 1 X 2 ], 

T2 = [xi, 2x3 - X2, 8x2X4 - 2X2 - 4]; 

Ti = 2x2 — 12xfx| + 9xix| — 9x^x| + 8xix| — 8x| + 24xfx2 — 24xfx2 + 18xf — 18x^, 
T 2 = 1x3 - xl] 

I = 2x2 + 3xf . 

To obtain an irreducible decomposition of 03, we determine the prime bases of 
Ti and T2 by computing the respective Grobner bases Gi and G2 of 

TiU{z/— 1} and T2 U {x2^ — 1} 



according to Lemma 1. The Grobner bases may be found to consist of 8 and 4 
polynomials respectively. Let Vi = Gi n /C[xi, . . . , X4] and 03i = Zero(Vi) for 
i = 1, 2. We have 



Vi 



Ti, 

27 xfx 3 — 27 xiX 3 + 2x| — 15 xix| + 9xiX2 + 8x1X2 — 8x2 + 12xi — 12xf, 
T2, 

12 xix| — 12 x§ — 9X1X3 — 2x1X2 + 8x2 + 4 xi — 4 xi, 

X1X4 — 2x1X3 + 2X3 — X2, 

X2X4 + 8X1X3 — 8x1X3 — x|, 

Pi 



and V2 = T2 such that 03 = 03i U OJ2. One can check with ease that this 
decomposition is minimal. 



Example 3. As a more complicated example, consider the algebraic variety de- 
fined by the following five polynomials 



Pi = X4X2 -I- Xi -I- X2X6 -I- 8x5, 

P2 = 54x4x5 -|- 9X4X2X6 — 9xixe — 9x2X7 — 18x3X2 — 2x|, 

P3 = 18x3X5 — 9x1x5-1-8x3X2X6-1-8x4X6X1-1-8x4x7X2 — 8x1X7 — 8x3X1—2x2X1, 
P 4 = 8X3X1X6 -I- 8X3X2X7 -I- 8X4X1X7 — 18x4x3X5 — 2x2X1, 

P5 = 9X3X1X7 — 27 x|x 5 — 2xf. 
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Let P = {Pi, . . . , P5} and the variable ordering be wi : xi xt. Un- 

der uji, P can be decomposed into 9 irreducible triangular sets Ti, . . . ,Tg such 
that 

9 

Zero(P) = Zero(Ti/ini(Ti)). 

i=l 

For z = 6, . . . , 9, the triangular set contains more than 5 polynomials and thus 
need not be considered for the variety decomposition by Lemma 4. Let Yi be 
the prime basis of under the ordering uji for z = 3,4, 5. Obviously T3 already 
defines an irreducible variety, so V3 = T3. It remains to determine the prime 
bases from Ti,T2,T4 and T5 according to Lemma 1 or 2. One may find that 
V4 = T4 and V5 is the same as the set obtained by replacing the last polynomial 
in T5 with 

9x7 + 9 x 4 X 5 — 2x2- 

A prime basis of Ti under uji contains 20 polynomials. To reduce the number of 
elements, we compute a Grobner basis of this prime basis with respect to another 
variable ordering W2 : X4 A X2 A xe A X3 A Xi A X7 A X5. The new basis Vi 
consists of 10 polynomials as follows 



Vi 



81 x| -I- 72 x 2 x| -I- 16x2X3 -I- 90 x|x|x 3 -I- 4x4X3 -I- 18 x|x 2 , 

6x4x|xi -I- 9x4X1 — 9 x 2 x| — 4x3X3 -I- 9X4X2X3 -I- 2x4X3 -I- 9x4X2, 
9x3X1 -I- 4 x|xi -I- 9x4X1 -I- 18x4X2X3 -I- 2x4X3 + 9x4X2, 

xf -I- X4X2X1 — X2X3, 

9X4X7 — 6X4X2X6X1 — 12 x|x 2 Xi -I- 9 x 6 x| -I- 18 x 4 x| -I- 4X2X6X3 

— 9X4X6X3 -I- 8X4x|x3 — 2X4X2X6 — 2X4X3, 

9X2X7 -I- 9X5X1 -I- 18X4X1 -I- 18X2X3 -I- 9X4X2X5 -I- 2X3 -I- 18X4X2, 
9X3X7 -I- 9x|x7 — 4x2X6Xi — 8X4X2X1 -I- 18X3 — 9X4X6X3 -I- 2X3X3 

— 2x4x|x6 — 2x|x3, 

9xiX 7 — 6x|xi — 18X4X1 -I- 9 x2XsX 3 — 18x4X2X3 — 4X4X3 — 18X4X2, 
8IX7 -I- 81x4X5X7 — 162x4X7 -I- 108x2X5X1 -I- 216x4X2X1 — 324 x§ 

— 81 XgX 3 -I- 162X4X5X3 — 72X3X3 -I- 54X4X3X5 — 4X3 -I- 36X4X3, 

Pi 



As for T2, the difficult case, let Ti denote the zth polynomial of T2 and It the 
initial of Ti for 1 < z < 5. The non-constant initials are 



I2, h, and I4 = I5 = X2- 

Thus, it is necessary to determine a prime basis from Tg by computing a Grobner 
basis of the enlarged polynomial set, for instance, T2U{2i/4 — 1, Z2/3 — 1, 23/2 — 1} 
or T2U{2;/2/3/4 — 1}. Nevertheless, the Grobner basis cannot be easily computed 
in either case. We have tried some of the most powerful Grobner bases packages 
without success. For this reason, we normalize Tg to get another triangular 
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set T 2 (see [21, Sect. 5.2] for the details of normalization): it is obtained from T 2 
by replacing T 2 and T 3 respectively with 

= —Ax\x 2 Xi + 81 a ;3 + 9 x 2 X^ — 9xfx'^ + 6 X 1 X 2 X 3 — 2 xf, 

T* = 972a;Ca;5 + 729(2a;^ + 27xf)xlxl + 81(2a;® + 9xfx^ - 81xf)xl 

— 648a:i(a;f + 9x^)x2a;3 + 9a;f ( 8 x 2 + ISOxfx^ + 81xf)x^ 

— 36xf (2^2 + 27xf)x2X3 + 2xf(4x2 + 90a;fa;| + 243a;f). 

T 2 and T 2 have the same set of generic zeros, so the prime bases constructed from 
them define the same irreducible algebraic variety. T 2 possesses the property that 
the initials of its polynomials only involve the parameters xi and X 2 - 

A prime basis of T 2 can be easily determined by computing the correspond- 
ing Grobner basis with respect to the variable ordering uji or 0 J 2 according to 
Lemma 1. The basis under tt >2 contains 9 elements and is as follows 

’ 81a;4a;g + lQx\x^ + 108a:4a;2a;6 + 324a;4a;6 + 20a;4a;2 -I- Iddslil -I- 324x4, ' 
144 x|x3 -I- 729X4X3 -|- 8IX4X6 -I- 16x2 -I- 144x|x| -I- 405x|, 

4xgX3 -I- 5 X 4 X 3 -I- X 4 X 6 -I- X 4 , 

4x2Xi -I- 27 x 4 X 3 -I- 2 x 4 X 2 -I- 9 x 4 , 

^ 18x6Xi -I- 36x4X1 — 18x2X3 -I- 9X4X2X6 — 2x2, 

^ 972x4x3X1 -I- 324x|xi — 1296 x2x| — 405x|x2X3 -I- 81x4X2X6 -I- 16x2 

-I- 108x4X2 -I- 243x|x2, 

144xf -I- 1296x| — 8IX4X3 — 8IX4X6 — 16x2 — 144x4x| — 405x1, 

6x7 -I- 18x3 -I- 3 x 4X6 -I- 2x| -I- 12x4, 

Pi 

It is easy to verify that both Zero(V 4 ) and Zero(V 5 ) are subvarieties of Zero(Vi). 
Therefore, the variety defined by P is decomposed into three irreducible subva- 
rieties defined by Vi,V 2 and V 3 . Symbolically, 

Zero(P) = Zero(Vi) U Zero(V 2 ) U Zero(V 3 ), 

where Zero(Vi) is irreducible for i = 1,2,3. 



Experimental Results 

We first provide timing statistics for irreducible variety decomposition on 22 
examples using the three zero decomposition algorithms reviewed in Section 2. 
The first 16 examples are given in [15]. Here are added to our test set the last 
six examples, for which the defining polynomials of the varieties as well as the 
variable orderings used for the computation are listed in the appendix. 

The heading entries Ritt-Wu, Seidenberg refined and Subresultant-based in 
Table 1 indicate the respective zero decomposition algorithms used for the ir- 
reducible variety decomposition. The experiments were made with Maple V.5 
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running on an i686 under Linux 2.0.36 at LIP6, Universite Paris VI. The tim- 
ings are given in CPU seconds and include the time for garbage collection. The 
symbol >2000 indicates that the computation was interrupted manually after 
about 2000 CPU seconds, and the cases marked with *** were rejected by Maple 
for “object too large.” 



Ex 


Ritt-Wu 


Seidenberg refined 


Subresultant-based 


1 


.05 


.02 


.02 


2 


.52 


.45 


.36 


3 


1.01 


.69 


.66 


4 


.43 


.11 


.07 


5 


.93 


.66 


2.43 


6 


2.03 


11.73 


113.06 


7 


.63 


.49 


.41 


8 


9.53 


1.15 


2.16 


9 


17.19 


4.92 


4.8 


10 


>2000 


>2000 


>2000 


11 


3.81 


5.06 


*** 


12 


7.58 


2.38 


7.05 


13 


17.11 


5.62 


8.35 


14 


33.54 


16.64 


19.68 


15 


29.47 


5.56 


4.18 


16 


*** 


*** 


*** 


17 


18.35 


1.54 


.72 


18 


>2000 


>2000 


>2000 


19 


*** 


44.39 


31.74 


20 


13.91 


6.96 


4.45 


21 


>2000 


>2000 


>2000 


22 


>2000 


>2000 


>2000 



Table 1. Irreducible variety decomposition for 22 examples 

From Table 1 one sees that irreducible variety decomposition using algo- 
rithms based on Seidenberg’s elimination idea and s.r.s. is faster than that using 
characteristic sets for most of the complete test cases. Example 6 is one of the 
few notable examples for which our algorithms are slower than Ritt-Wu’s for 
the irreducible zero decomposition as already pointed out in [18]. The differ- 
ence of computing times with the three zero decomposition algorithms is often 
not significant because the involved saturation bases computation may take a 
large part of the total computing time. It is also possible that computing the 
saturation bases from the triangular sets requires more time than that from the 
ascending sets. This actually happens with Example 9 (for which the irreducible 
zero decomposition by either algorithm does not take much time) in Maple V.3. 

As we have explained previously, there are two different ways for computing 
the prime/saturation bases of triangular sets. There are also a few variants for 
the algorithms which may affect the computation. Let T = [Ti,...,Tj.] be an 
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irreducible triangular set in /C[a;], Fi, . . . , be all the distinct irreducible factors 
(over JC) of 111=1 F = Fi---Ft. We consider the following four 

variants for the construction of a prime basis V of T: 

VI. Compute the Grobner basis G of TU{zF— 1} with respect to plex^ with xi -< 
■ ■ ■ -< Xn ~< z and take G H /C[a;] for V, where z is a new indeterminate; 

V2. Compute the Grobner basis G of T U {ziFi — 1 , . .. ,Zt Ft — 1} with respect 
to plex with xi < Xn zi Zt and take G C JC[x\ for V, 

where Z\, ... ,Zt are new indeterminates; 

V3. Compute the generating sets Vfe for Ideal(T) : F\ = 1, 2, . . until Ideal(T) : 
F^ = Ideal(T) : and take the last Vfe for V, where the quotient ideals 

are computed using plex Grobner bases with x\ ^ ^ Xn ^ z according 

to what is explained in the paragraph after Lemma 2; 

V4. Normalize T to obtain an equivalent triangular set T (see [21, Sect. 5.2] and 
Example 3) and then compute a prime basis of T according to V2 above. 

The following table shows the computational effect of the four variants. 



Ex 


VI 


V2 


V3 


V4 


3 


.41 


.46 


1.51 


.24 


9 


11.18 


11.98 


63.11 


5.18 


13 


3.09 


— 


64.5 


2.43 


14 


16.19 


11.22 


246.62 


12.24 


18 


>2000 


>2000 


>2000 


>2000 


19 


109.64 


23.45 


>2000 


>2000 


20 


2.6 


— 


23.49 


.79 


21 


>2000 


>2000 


>2000 


>2000 


22 


>2000 


>2000 


>2000 


>2000 



Table 2. Variants for prime bases computation 



The dash — in Table 2 denotes the cases in which the variant V2 coincides 
with VI (i.e., t = 1). It occurs that the method V3 performs worst for these 
examples. For the two better variants V2 and V4, it is not evident which one 
is superior to the other. An additional message is that, in Maple V.3 running 
on an Alpha station 600 at MMRC, Academia Sinica, the Grobner bases were 
computed in 602.62 and 1371.7 CPU seconds respectively with VI and V2 for 
Example 18, and in 201.68 CPU seconds with V4 for Example 21. This makes 
it even difficult to predetermine which of the three variants VI, V2 and V4 to 
take for a concrete problem. 

As the computation of saturation bases represents a major time-consuming 
step, it is of interest to try other Grobner bases implementations for our test ex- 

^ We have tried the new Groebner package with the elimination term order lexdeg in 
Maple V.5. For most of our examples, notably Ex 9 and Ex 19, the computation 
using lexdeg is much slower than that using plex. 
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amples. We did so using the Grobner basis packages in Reduce 3.4 and Risa/Asir^ 
in comparison with grobner in Maple V.2, and the Gb package of J.-G. Faugere in 
comparison with Groebner in Maple V.5. For Example 14, 14 Grobner bases have 
to be computed and the variety is decomposed into 6 irreducible components. 
The detailed timings obtained on one and the same machine at ISIS in August 
1993 for Maple V.2, Reduce 3.4 and Risa/Asir, and on the above-mentioned i686 
at LIP6 for Maple V.5 and Gb 3.1156 are exhibited below. 



No 


1 


2 


3 


4 


5 


6 


7 


Maple V.2 


7.32 


11.55 


5.12 


5.83 


23.08 


3.43 


27.17 


Reduce 3.4 


.58 


1.92 


.51 


.73 


1.72 


.32 


9.08 


Risa/Asir 


1.5 


2.91 


.78 


1.4 


4.89 


.5 


7.74 


Maple V.5 


.74 


.59 


.37 


.53 


3.08 


.22 


.77 


Gb 3.1156 


.03 


0 


.01 


0 


.09 


.01 


.02 


No 


8 


9 


10 


11 


12 


13 


14 


Maple V.2 


4.75 


3.03 


3.07 


121.93 


4.08 


2.77 


3698.38 


Reduce 3.4 


.38 


.23 


.52 


14.65 


.32 


.38 


944.18 


Risa/Asir 


.75 


.56 


.69 


14.97 


.74 


.74 


243.76 


Maple V.5 


.32 


.19 


.17 


6.31 


.22 


.23 


4.31 


Gb 3.1156 


.01 


.02 


.01 


.05 


0 


.01 


.02 



Table 3. Grobner bases computation for Example 14 

The remarkable difference of timings shown in Table 3 was somewhat to our 
surprise, and indicates the inefficiency of the Grobner basis packages in Maple 
and the interest in implementing our decomposition algorithms using another 
computer algebra system or a low-level programming language for effective ap- 
plications. We are still working with Maple mainly because of our experience 
and the existence of an elimination library we have developed. Although Gb 
took little time to compute the 14 Grobner bases for Example 14, it did not 
succeed in computing the two Grobner bases needed for Example 18. We are 
going to try Faugere’s FGb, a new version of Gb, and other implementations of 
Grobner bases such as those in Magma and GoGoA for large examples. 



Techniques for Practical EfRciency 

The algorithms for variety decomposition discussed in the previous sections em- 
ploy several time-consuming subalgorithms. Improvements on each subalgorithm 
have positive consequence for the overall performance. Roughly speaking, the 

^ This is an experimental system developed by the computer algebra group at ISIS, 
Fujitsu Laboratories Limited, Japan. M. Noro from this group informed the author 
that the computing time with Risa/Asir in Table 3 may be much reduced by setting 
some switches on. Moreover, by using another C implementation of a Grobner basis 
package in Risa/Asir, the 14 Grobner bases can be computed in 3 GPU seconds. 
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main algorithms consist of two essentially independent steps: one computes the 
zero decomposition and the other constructs the saturation bases. If the zero 
decomposition cannot be completed, the variety decomposition fails. When the 
first step is successful, we come to the second step of constructing the saturation 
bases. Two different algorithms with variants for this step have been explained 
in Section 3. Our experiments have shown the dependence of their efficiency 
on problems and the potential of using other Grobner packages. The construc- 
tion of saturation bases is unnecessary in some cases, for instance, when in (1) 
e = 1 and Ti is irreducible (and thus the given variety is irreducible), or all 
are of dimension 0, or all the initials of the polynomials in are constants. 
This happened for about half of the cases in our test. Construction of the prime 
bases requires expensive Grobner bases computation; for this we simply use the 
grobner/Groebner packages in Maple without modification. For four of the test 
examples we failed in computing the Grobner bases within 2000 CPU seconds 
for each or because of “object too large.” 

For zero decomposition, we continue studying algorithmic and implementa- 
tional strategies to speed up each substep. Some of the strategies and techniques 
have been discussed in the papers where the zero decomposition algorithms are 
proposed and experimented. In what follows, we mention a few removing tech- 
niques, some of which have not been given before. 

Early Removal of Redundant Components. The zero decomposition me- 
thods reviewed in Section 2 may produce many redundant branches/components. 
It is always desirable to remove such redundant components as early as possible. 
In fact, there are a number of techniques for doing so, see [3,7,15,17,18] for 
example. One of the most effective techniques suggested by Chou and Gao [3,7] 
for variety decomposition is the application of the affine dimension theorem 
(i.e., Ti is redundant if jT^j > jPj = s, see Lemma 4) when a W-characteristic 
series of P is computed. For being more efficient, we have used this technique 
dynamically (i.e., whether the final T^ will contain more than s polynomials 
is examined during the computation of T^) in the implementation of our zero 
decomposition algorithms based Seidenberg’s idea and s.r.s. This has taken the 
advantage of top-down elimination: for a triangular system [T, U], it is known 
that either ]Tj > s or Zero(T/U) = 0 as soon as the last s polynomials of T have 
been obtained and at least one more polynomial of T can be obtained (it is the 
case when there still exist polynomials for elimination to produce T). In other 
words, a redundant triangular system may be removed before it is completely 
computed. This technique is particularly useful for removing such redundant 
triangular systems [T, U] of lower dimension for which the first few polynomials 
in T are large and difficult to compute. 

For another technique, consider the last step U3 of UnmVarDec which aims at 
removing some redundant components. This step does not appear in IrrVarDec 
due to the following observation. Since T taken from S' in step 12.1 has the 
highest dimension, the remaining triangular sets T* in are all those whose di- 
mensions are not higher than that of T; thus Zero(sat(T*)) are all the candidates 
of subvarieties of Zero(sat(T)). After the saturation basis G of T is computed in 
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step 12.2, we immediately compute prem(G, T*) in step 12.3, and remove such T* 
when prem(G, T*) = {0} is verified. Therefore, in the case where some T* is suc- 
cessfully removed, its saturation basis must not have been computed. 

This simple technique deducts the time for computing the prime bases that 
are redundant, and does not lead to any additional cost. The redundancy occurs 
often for examples from geometry theorem proving where the corresponding va- 
rieties are of higher dimension. We applied the technique to Example 14 and 
got a further surprise. Among the 14 Grobner bases, 8 (including the one whose 
computation takes much time) correspond to redundant subvarieties and need 
not be computed according to this technique. The previous total decomposition 
time 2663.2 CPU seconds (with Maple V.2 on a SUN SparcServer 690/51) is 
reduced to 113.5 seconds. In other words, the technique speeds up the decom- 
position by a factor of 23. 

However, the above-explained technique cannot be employed when the pseu- 
do-division verification is replaced by the normal form reduction. 

Removal of Computed Subvarieties. Now we describe a technique to re- 
move a subvariety from the given variety as soon as the subvariety is computed. 
This is similar to the case of polynomial factorization in which an irreducible 
factor can readily be removed from the polynomial being factorized when the 
factor is found. However, the removal of subvarieties appears much more dif- 
ficult computationally. The removing technique can be incorporated into the 
decomposition algorithms according to the following theorem. 

Theorem 6. Let P and Q = {Ti, . . . , Ft} be two polynomial sets in IC[x] with 
Zero(Q) C Zero(P) and 3 be the ideal generated by 

P U -I- • • • -I- — 1} in K,[x,z] (7) 

or by 

PU{ 2 ;iFi-| \-ztFt-l} in K.[x, zi, . . . , Zt], (8) 

where z, z\, . . . , Zt are new indeterminates. Then 



Zero{P) = Zero{Q) U Zero{3 n /C[a:]). 

Proof. Consider the case in which 

3 = Ideal(P U {zFi H h z*Ft - 1}). 

Let X e Zero(P). For any P € U n /C[a:], there exists a polynomial Q S IC[x,z] 
such that 

P — Q{zFi -I- • • • -I- z*Ft — 1) G Ideal(P) C K.[x,z]. 

Hence 

P{x) = Q{x,z)[zFi{x) H h z^Ft{x) - 1] (9) 

for arbitrary z. Suppose that x ^ Zero(Q). Then there exists some j such 

that Fj{x) ^ 0. So there is a z S F such that zFi{x) -I- ■ ■ ■ -I- z*Ft{x) — 1 = 0. 

Plunging z into (9), we get P{x) = 0. Therefore, Zero(P) C Zero(Q) U Zero(U n 

K.[x]). 
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To show the opposite, let x G Zeio{3 n IC[x]). Obviously, for z G K.{x, z] and 
any P G P 

Zero(P) C Zero(P(zPi H h z^Ft)). 

By Hilbert’s Nullstellensatz, there exists an exponent q > 0 such that 



P«(zFi + --- + z‘Ft)« G Ideal(P) cJC[x,z]. 



It follows that 

+ P9[(zPi + • • • + + (zPi + • • • + z^FtY-'^ + • • • + 1] 

• (zFi + • • • + z*Pt - 1) G Ideal(P) C JC[x, z], 

so that P'J G 3. Since P does not involve z, G 3 f]lC[x]. Hence, P'J(S) = 0 
and thus P{x) = 0. This proves that Zero(3 n K.[x]) C Zero(P). 

The case in which 3 = Ideal(PU{ziPi + • • • + ZtPt — 1}) is proved analogously, 
observing that if Pi(S), . . . , Ft{x) are not all 0, then there exist zi, . . . , Zt such 
that ziFi{x) + • • • + ZtFt{x) — 1 = 0, and P‘^{z\Fi + • • • + ZtFtY G Ideal(P) C 
JC[x, zi, . . . , Zt] for some integer g > 0. 

This theorem suggests a way to remove any subvariety Zero(Q) from the 
given variety Zero(P) by determining a finite basis HI for the ideal 3 n JC[x]. 
The latter can be done, for instance, by computing a Grobner basis of (7) or 
of (8) with respect to plex determined by Xj -< z or Xj -< zi together with 
its elimination property. Thus, decomposing Zero(P) is reduced to decomposing 
Zero(Q) and Zero(E[). The author has tested this technique. Nevertheless, the 
Grobner bases computation in this case is too inefficient and we had no gain 
from the experiments. One can make use of the technique only when a more 
effective procedure for determining the finite bases is available. 

In fact, the removal of Zero(Q) from Zero(P) corresponds to computing the 
quotient Ideal(P) : Ideal(Q). The latter can be done by a possibly more efficient 
algorithm described in [5, pp. 193-195]. 



Applications 

Decomposing given algebraic varieties into irreducible or unmixed components 
is a fundamental task in classical algebraic geometry and has various applica- 
tions in modern geometry engineering. Among such applications we can mention 
two: one in computer-aided geometric design where the considered geometric 
objects are desired to be decomposed into simpler subobjects and the other in 
automated geometry theorem proving where the configuration of the geometric 
hypotheses needs to be decomposed in order to determine for which components 
the geometric theorem holds true. 

An irreducible decomposition of the algebraic variety defined by P corre- 
sponds to a prime decomposition of the radical of the ideal 3 generated by P. 
From the latter a primary decomposition of 3 can be constructed, see [6,13] for 
details. This points out an application of variety decomposition to computational 
polynomial ideal theory. 
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In various mathematical problems such as the qualitative study of differen- 
tial equations (where Example 3 comes), necessary and sufficient conditions are 
often established and given as polynomial equalities in different ways and by dif- 
ferent researchers. Variety decomposition provides an effective way to examine 
the relationships such as equivalence and containment among different sets of 
conditions. 
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Appendix 

List of Examples 17 22 

In the following list, P is the polynomial set that defines the algebraic variety to 
be decomposed and uj is the variable ordering used for the computation. These 
examples were chosen quite randomly yet with moderate difficulty so that the 
algorithms under discussion can be tested. 

Example 17. Let P = — x — y,x^ — z, — t} with lo: t ri z ri y ri x. 

The variety defined by P is irreducible and is of dimension 1 in the 4-dimensional 
space. The set of polynomials is taken from a paper by C. Traverse and L. Donati 
(Proc. ISSAC ’89, pp. 192-198). 

Example 18. Let P = {Pi, . . . , P 4 }, where 

Pi = (a: - u)2 + {y- v)"^ - 1, 

P2 = — u^, 

P 3 = 2v{x — It) -I- 3u^{y — v), 

P 4 = {3wu^ — l){2wv — 1) 



Decomposing Algebraic Varieties 205 



with UJ-. x<y^u^v^w. The irreducible decomposition of the variety defined 
by P could not be completed within 2000 CPU seconds. The set of polynomials 
was communicated by P. Vermeer (Department of Computer Science, Purdue 
Univ., USA). 

Example 19. Let P = {Pi,P 2 ,P 3 }, where 

Pi = {xi - yi)"^ + {x 2 - y 2 f - {x[ - y'l)^ - (4 - y 2 )^ 

P2 = {x[ - Cl)^ + {x'2 - C 2 )^ - (Xl - Zi)"^ - {X2 - Z2Y, 

P3 = {y{ - Cir + {y'2 - C2? - [yi - - {y2 - Z2Y 

with LO: X\ < X 2 < yi < y 2 Zi < Z 2 < x'l A x '2 -< y'l < y '2 < Y ^ ^ 2 - 
The variety defined by P may be decomposed into two irreducible components. 
The set of polynomials is taken from a paper by J. Pfalzeraf (Ann. Math. Artif. 
Intell. 13: 173-193). 

Example 20. Let P = {Pi, . . . , Pq}, where 

Pi = 2xi - 1, 

P 2 = xl + xl - 1, 

P 3 = xl + xl - ul - u\, 

P 4 = 2 u 2 X 4 + 2 uiX 3 — ul — ul, 

P 5 = xl + xl — 2x5 — ul ~ "^1 P 2 ui, 

Pq = 2 u 2 Xq + 2 u\X 5 — 2 x^ — ul — ul + 1 

with Lu\ Ul ^ U 2 ^ Xl ^ ^ xq. The variety defined by P may be decomposed 

into 9 irreducible components. The set of polynomials comes from an algebraic 
formulation of Steiner’s theorem given in a paper by the author (Proc. CADE-12, 
LNAI 814, pp. 386-400). 

Example 21. Let P = {Pi, . . . , Pq}, where 

Pi = (3 m{ — 1 ) 2:2 -I- ui{ul — 3)a;i — ui(3it{ — 1), 

P 2 = (3^2 — 1 ) 2:2 + U 2 {ul — 3 ) 2:1 — U2(3m2 — 1 ), 

P 3 = -2:iPi2:4 -I- 3{F2X3 + XiP3)xl 

+ 3[2:iPi2:| — 2 ( 2:2 — ^ 22:2 — 2mi2:2 — U2xlx2 — 2uixlx2 + Fa)x3 
— Xi(xl + 2x1x1 + ulu2X2 — P4)]2:4 
— P22:3 — 3xiP32:| -I- 3(2:| -I- 22:^2:2 + xfx2 — ulxlx2 — Fq)x 3 
— Xi{2uiU2xl + ulxl — 2ulu2xl + 2uiU2xlx2 — Fq), 

P 4 = —(X2 + Ui2:i — 141)2:4 -I- (ui2:2 — 2:1 — ul)x 3 + U1X2 + ulxi — ul, 

P5 = —(X2 + U2X1 — U2)Xq + {U2X2 — Xl — uI)Xq + U2X2 + ulxi — ul, 

Pq = —{X 1 G 1 X 4 — G 2 X 3 — XiG 3 )xq + {G 2 X 4 + X 1 G 1 X 3 — G 4 )xq 

+ X 1 G 3 X 4 — G 4 X 3 — Xi{u 2 xl + Uixl — 2 U 1 U 2 X 2 + U 2 xl + Uixl) 
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with 

Fi = 3x2 ~ 2 m 2 X 2 — 4uiX2 — x\ + 2 uiU 2 + u\, 

F2 = X2 — U2X2 — 2 mix| — 3 x^X 2 + 2U1U2X2 + wfx 2 + U2X1 + 2 uiXi — u\u 2 , 
F 3 = 2x| — U 2 X 2 — 2uix| + 2 x\x 2 — U 2 x\ — 2 u\x\ + u\u 2 ^ 

F4 = 2u\U2x\ + u\x\ — U1U2X2 — x\+ 2 u\U2x\ + u\x\^ 

F5 = U2X2 + 2mix| — 2M1M2X2 — U1X2 + 2m2X^x| + 4 uiX^X 2 + u\u 2 x\ 

— 2 u\U 2 x\x 2 — 2 u\xiX 2 + U 2 xf + 2 uix\ — u\u 2 x\, 

G\ = 2X2 — U2 — Ui, 

G2 = x\ — U2X2 — U1X2 — xf + U1U2, 

G 3 = x| + xf — U1U2, 

G 4 = x| — U2X2 — U1X2 + xfx2 + U1U2X2 — U2x\ — Uixl 

and w: ui ^ M 2 ^ xi ^ ^ xq. The variety defined by P may be decom- 

posed into 12 irreducible components (in Maple V.3 on an Alpha station 600) 
when the technique of normalization is used. The set of polynomials comes from 
an algebraic formulation of Morley’s theorem given in a paper by the author 
(Proc. CADE-12, LNAl 814, pp. 386-400). 

Example 22. Let P = {Pi, . . . , P 7 }, where 

Pi = 4Mfw2x{ — ( 2 U 2 M 3 -I- 2u\u 2 — E){2u\u\u'i — 2m| -|- u\E)^ 

P 2 = 4m2(2m^xi — 2u\u3 — A)x| -I- G^[4/x2 — 2u\D^x\ — P], 

P 3 = —Au 2 { 2 .u\x\ + 2u\u‘i + A)x| -I- G^[ 4 /x 3 -I- 2u^P^xi — P], 

P 4 = U 1 U 2 X 4 — u\u 2 + 1, 

P 5 = M 1 U 2 X 5 +u\ — u\, 

Pe = 2 u\[u\{x^ + X4) — GD]xq — u\E{x^ + X4) + BGD, 

P 7 = Jx 7 — [ 2 ufu 2 X 5 — {u 2 — u\){u\u^ + l)]xe + u\Ex 5 — U 2 + u\ 



with 

I = {u\ — — 1), P = 2[u^U2 — 2{B + uf)u 2 + uf\u 3 — AD^\ 

A = — 1, B = u\ + 1, G = — 1, P = U 2 + 1, P = M 2 + 1 

and UJ-. u\ ^ U 2 < U 3 < x\ ^ XT- The irreducible decomposition of the 

variety defined by P could not be completed within 2000 CPU seconds. The set of 
polynomials comes from an algebraic formulation of Thebault-Taylor’s theorem 
given in Yang and Zhang (1994). 
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Abstract. Getting accurate construction of tridimensional CAD mod- 
els is a field of great importance: with the increasing complexity of the 
models that modeling tools can manage nowadays, it becomes more and 
more necessary to construct geometrically accurate descriptions. Maybe 
the most promising technique, because of its full generality, is the use of 
automatic geometric tools: these can be used for checking the geomet- 
rical coherency and discovering geometrical properties of the model. In 
this paper, we describe an automatic method for constructing the model 
of a given geometrical configuration and for discovering the theorems of 
this configuration. This approach motivated by 3D modeling problems 
is based on characteristic set techniques and generic polynomials in the 
bracket algebra. 



1 Introduction 

The construction of tridimensional CAD models is of growing importance: with 
the increase of power of the computers and networks, it is now a common ex- 
perience to navigate within a virtual 3D world. One of the biggest challenges 
to obtain the best navigation is to get models that are both photo-realistic and 
accurate. One particular technique, derived from computer vision, makes possi- 
ble the construction of a 3D model of a scene, starting from simple pictures of 
it [10,9] (Fig. 1). 

In order to achieve this result, several stages are necessary: first, special image 
features (i.e. corner points) must be extracted from the pictures and matched 
across the various views. These matched features are then used to calibrate the 
cameras, which consists in finding the parameters defining their states (such as 
their focal distances) and positions in the Euclidean space. Once the cameras are 
calibrated, reconstructing a 3D feature is just a matter of triangulation provided 
that the object is visible in at least two pictures. Using such a scheme, it is quite 
easy to obtain a realistic 3D model since the photometric information of the 
pictures can be used to augment the pure geometric model. 

However, in most cases, these techniques are based only on the viewing ge- 
ometry (i.e. the geometry of light rays captured by the cameras). Since most 
human made objects (e.g. buildings, tools, . . . ) are usually geometrically struc- 
tured (coplanar or aligned points, orthogonal or parallel lines, . . . ), a very rich 
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Fig. 1. The 3D model 

source of information is thus ignored, whereas it could be used to “stabilize” re- 
sults obtained from the noisy data. So far a priori knowledge of the scene to be 
reconstructed is used only through CAD models describing it in terms of geomet- 
rical primitive blocks such as cubes, pyramids, cones, spheres, . . . Each primitive 
block being parameterized, it is then possible to “optimize” the 3D model so as 
to obtain the best 3D structure predicting the observed images [9,16]. In many 
cases, creating such CAD models is a tedious task and a weaker description of 
the geometric structure (in terms of raw geometrical properties) would be more 
convenient. 

Let us illustrate our claim by a picture (Fig. 2): the scene is a district of a 
city. As it can be seen, buildings are never completely visible so that it would 
be difficult to model those as polyhedrons. On the contrary, it is fairly easy to 
associate to each frontage a polygon and to each edge a segment, as well as 
polyhedrons to some details. These primitives can then be constrained using 
relations such as orthogonality (e.g. of frontages), parallelism and angle (for 
windows). Of course, not all properties need to be introduced one by one and 
higher level of abstraction can be used: for example, rectangles can be associated 
to each window. 

The problem which then arises is organizing these descriptions in such a way 
that it can be used by reconstruction algorithms. If we want a fairly generic 
system, this definitely requires some form of geometrical reasoning. The poten- 
tial importance of using such reasoning methods for computer vision has been 
recognized for many years [12], as it is natural to think that geometry plays a 
significant role in the analysis and comprehension of at least human made scenes. 
Taking account of it would greatly improve the quality and the generality of the 
current systems. However, the complexity of the usual scenes makes the use of 
current tools difficult. Consequently, those have only been used so far in simple 
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Fig. 2. Geometrical features in a picture of buildings 



limited situations, e.g. to constrain the position of the camera directly from the 
observations of certain known geometric patterns [13,22]. 

In this paper, we intend to show how some techniques inspired by automated 
geometric reasoning can be used as a way to structure the a priori geometrical 
knowledge of a given 3D situation in such a way that it can be used to de- 
rive better 3D models when using computer vision techniques. The next section 
describes the computer vision problem we are interested in and presents the 
geometrical tools that are needed in order to solve it. Section 3 then analyzes 
the inherent difficulties of using the tools that have been developed within the 
framework of automatic theorem proving. Section 4 proposes the two new tools 
that have been developed to cope with our problem. Finally, Section 5 shows 
results obtained with those tools. 

2 Building Geometrically Constrained 3D Models from 
Images 

3D models such as the one shown in Fig. 1 are built uniquely from a set of images 
(Fig. 3). As sketched briefly in the introduction, three main steps are used in 
order to achieve this results^: 



^ The whole process is depicted with point primitives for the sake of simplicity, it is 
however more general and can be used with other kinds of primitives such as lines. 
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Fig. 3. Different views of “place des arcades” , Valbonne 



— First, characteristic points are extracted from each image. 

~ Those points are then matched across the images: points corresponding to 
different views of a same 3D point are associated. 

— Point matches are then used to establish a set of 3 x 3 matrices known as 
fundamental matrices that encompass the projective structure of the rigid 
scene for each two images. The fundamental matrices are then combined in 
order to establish the Euclidian positions and properties of each of the cam- 
eras. Note that this whole process is based only on the weak assumption that 
the scene is Euclidian and rigid. Once this geometry has been established, 
it is easy to obtain a 3D point for each of the point matches by intersecting 
the corresponding rays. An initial 3D model has thus been obtained. 

However, this scheme still has to face with many problems: 
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— Extracted points are always corrupted with noise. This noise has many 
sources: electronic or optic effects, sampling, discretization and even biases 
when automatic feature extraction is used. 

— Incorrect matches can occur: even though a certain number of such matches 
can be (and are) detected during the fundamental matrix computation, some 
of those may still influence the final result of the algorithm. 

— Computing the camera positions and properties from fundamental matrices 
is inherently non-linear. In difficult situations, this gives rise to imstabilities 
that perturb the final result. 

Of course these effects are maximized if using fully automatic methods and 
the trend has been to use manual or semi-automatic methods instead. Never- 
theless, such inaccuracies always remain. This is all the more annoying that the 
process sketched above is not directly optimizing the quality of the 3D model 
with respect to the observed data. Consequently, one additional step is usually 
performed to complete the above process. This step called bundle adjustment re- 
fines the 3D structure and cameras parameters in such a way that the quadratic 
errors between observed and predicted points in the images are minimized. To 
formalize things better, let us call Mj a given 3D points and its observed 
image obtained with the i-th camera Pi. Denoting by Pi(Mj) the predicted im- 
age point corresponding to m^, and assuming that there are N cameras and m 
3D points, the bundle adjustment process minimizes the following criterion: 



where c?(mi,m 2 ) represents the distance between mi and m 2 . Note that since 
the 3D points Mj are computed from the data, the parameters over which the 
minimization is carried on are just the Pi,i = In practise, this min- 

imization is carried on by doing alternating sequences of 3D reconstructions 
(fixing the camera parameters) and of camera calibrations (fixing the 3D point 
positions). Figures 1 and 4, show the results obtained using the data of Fig. 3. 
The 3D-model is shown from different points of view. With the orthographic 
top view, we can see that the roofs are not rectangular and that the frontages 
are not planar. These models are not verifying exactly some of the geometrical 
properties of the scene that would be expected [2,1]. 

2.1 Incorporating Constraints in the Reconstruction Process 

Of course, one way to improve this is to use the knowledge of geometrical prop- 
erties of the scene: this would both stabilize the calibration and give models 
which respect the 3D geometry of the scene. To do so, we consider an enriched 
input consisting of: 

— A set P of 3D primitives which consists of the reconstructed 3D points 
that have been used so far are used but also of logical grouping of those 
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Fig. 4. A wire frame view of the 3D-model 



to introduce higher-level kinds of primitives such as 3D planes or lines. As 
before, each 3D point is attached to a set of matched image points. 

— A set C of geometric constraints attached to these primitives (incidences, 
angles, . . . ). 

This input has to be provided by the user. Using the image points associated 
with P, we compute a set of camera positions and properties as well as an initial 
3D model as above. But we now want to introduce the known constraints C in 
the bundle adjustment process. This can be achieved in two different ways: 



Using a Parameterization. One technique would be to exhibit somehow a 
parameterization p of the scene or more precisely of the set of 3D primitives P 
obeying to the constraints C. Such a parameterization can usually be defined by a 
subset M of the 3D points of P and by a set of expressions Mi{p) derived from C 
defining all the other 3D points of P in terms of those of M. By construction, P 
is then completely defined in such a way that the constraints C are satisfied. 
Using such a tool, it is easy to modify the bundle adjustment process to take 
account of the constraints. It is sufficient to just modify the 3D reconstruction 
part by: 

— Reconstruct only the points of Ai from the image data. 

— Deduce the other 3D points using the rules Mi{p). 
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The cost function that is minimized then becomes: 

N m 
i=i j=i 

This scheme has been implemented. The result obtained for the above scene is 
shown in Fig. 7. As the points of A4 play a special role in the method, problems 
can be expected if those points are very erroneous due to image measurements as 
those errors will be propagated to the complete model. This is one weakness of 
the method. Its major advantages are its simplicity (once the parameterization 
is computed) and the correctness of its results as it is guaranteed that the 3D 
constraints C are satisfied at each time instant. 



Using an Optimization Method. An alternative to the previous method is 
to rely on optimization techniques to incrementally improve the geometric 3D 
structure at each reconstruction step. In its principle, implementing this method 
is not very difficult. Each constraint of C can be expressed as an algebraic 
expression that must cancel out. Minimizing the residuals of those expression 
with respect to the 3D point coordinates is just a matter of gradient descent. 

As such, this method suffers from problems similar to those arising in the 
method presented in the previous section: although all the 3D primitives seem 
to play a similar role, this is still not true as geometrical constraints explicitly 
written in C have a particular status with respect to all the various geometric 
properties of the figure that are consequences of those. The net effect of this 
is that again, some primitives will play a particular role. To understand better 
this problem, let us consider a set of lines li,l 2 , ■ ■ ■ ,lk- We want to state that all 
these lines are parallel by writing that Ij is parallel to li for j = 2, ... ,k. In this 
scheme, li is clearly granted with a central role that is completely artificial. If it 
turns up that h is precisely a primitive that has been reconstructed erroneously 
whereas h, ■■ - Ik are recovered correctly, then this error will tend to propagate 
to all the other lines. This effect would not have appeared if all the parallelism 
constraints had been explicitly stated in C. 

This suggests that it might be extremely important to be able to generate 
all the consequences of the given set of constraints C. Using all those in the 
minimization process would reduce the risk of propagating errors inconsiderately. 

In the following sections, two techniques are presented. The first one attempts 
to compute a parameterization p given P and C and has been used with the 
program totalcalib to produce the result shown in Fig. 7. A comparison with 
the result given in Fig. 4 clearly shows the improvement of the geometric 3D 
structure. The second technique shows how consequences of C can be system- 
atically computed. The modified bundle adjustment of Section 2.1 has not yet 
been implemented, but, as explained above, using this technique should greatly 
improve the stability of such a method. 
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3 Drawbacks of the Classical Geometric Reasoning Tools 
in Computer Vision 

In order to do automatic geometrical reasoning, it is necessary to translate the 
geometry into an algebraic form. Nowadays, two kinds of techniques are opposed: 
one uses a reference frame (a coordinate system), the other, on the contrary, is 
free from the choice of a reference frame, using more intrinsic quantities, i.e. 
invariants. We briefly summarize these approaches and evaluate them in the 
context of computer vision problems. 

3.1 Using a Reference Frame 

Here, we give short descriptions of some provers which use a coordinate system 
to formulate the proofs. We refer to [21] for a more complete treatment of the 
algebraic methods that are used. The most well-known techniques for theorem 
proving are probably Wu’s method [4,20,19] and Grobner basis method [6,23]. 
In these methods, the geometrical objects are defined by several parameters 
(for a point, its coordinates; for a line in the planar case, the 3 coefficients 
of its equation; . . .). The geometrical properties are represented by polynomial 
equations into the parameters of the objects. Roughly speaking, proving a geo- 
metric theorem is equivalent to showing that a polynomial vanishes on a certain 
manifold, which corresponds to the generic configurations of the theorem. More 
precisely [25,26,6], this “reduces” to show that the polynomial representing the 
conclusion of a theorem is a member of the radical ideal generated by the poly- 
nomials representing its hypotheses divided (or saturated) by the ideal generated 
by its degeneracy conditions. 

Characteristic set methods transform the initial system of equations into 
one (or several) triangular set(s), by which any valid geometric property of the 
configuration can be reduced to zero. These triangular sets are obtained as fixed 
points of an autoreduction process of the initial equations. 

Grobner basis methods compute a basis of the initial ideal saturated by the 
degenerate conditions. This also allows the reduction to zero of any polynomial 
which corresponds to a geometric property of the configuration. 

3.2 Using Geometric Invariants 

Another class of methods uses invariant calculus and are of particular interest 
in computer vision (for these invariants quantities may be computed directly in 
the images) . The theorem provers of this family use intrinsic computations, that 
are usually more compact and easier to interpret in natural language. We briefly 
review some of these provers (see also [21]). 

In the prover Cinderella [7,17], the geometrical properties are represented 
by polynomials in brackets (more precisely by binomials of determinants) , many 
geometric properties can be written in this way. In this case, the reduction of the 
conclusion is performed by simple linear algebra on integer matrices representing 
the exponents of the variables involved in these binomials. 
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In the works of [5,14], hypotheses are also translated into polynomials of 
the bracket algebra (extended in GEX [5] by other primitives like the area of 
quadrilateral, inner products of vectors, angles). The methods are essentially 
based on rewriting rules on points. In the system GEO [14], only collinearity 
relations are considered. Rewriting rules on points are constructed formally in 
the exterior algebra, taking into account the generic degree of freedom of each 
point. 

Clifford algebras which are exterior algebras extended with a quadratic form 
and a related product (see [8]) have also been used [18,22]. Theorems are proved 
by reduction modulo algebraic expression associated with the hypotheses. 

Other primitives like distance between points have also been considered. 
In [11] for instance, the geometrical properties are translated into polynomi- 
als in the distances between points, and straightening laws are applied to get a 
normal form and test the validity of some theorems. 

3.3 A Comparison of These Methods in Our Context 

We come back now to our problem of 3D modelisation and compare first the 
existing provers in this context. 

Wu’s and Wang’s methods are ordering the points and using a system of co- 
ordinates, according the theorem to be proved. The method has the advantage 
to deal uniformly with different kinds of geometric constraints (collinearity, or- 
thogonality, angles, . . . ). The interpretation of the polynomial calculus in terms 
of geometric relations is not obvious. Moreover, as experimented on the scene 
“place des arcades” (Fig. 1), the computation is very expensive on problems 
involving many points or objects. In this scene, 102 parameters and 101 equa- 
tions are necessary to describe the geometry of the scene. With such a number 
of parameters, it is not possible to use characteristic set or Grobner basis tech- 
niques, for we have an explosion of the computation time and memory resource 
during the computation of these polynomial sets. Notice that this explosion is 
essentially due to the inter-reduction of the polynomials. 

The Richter-Gebert Grapo method has the advantage to reduce the proof 
to simple linear algebra on integer matrices. Usually the proofs are very short. 
However, there is no way to assert that a property is generically false (we have 
only sufficient conditions) . Gombining both points, lines, planes in a same com- 
putation seems to be difficult. The extension to the 3D case is possible but 
expensive. 

The method of Ghou-Gao-Zhang has the advantage to produce fast and sim- 
ple proofs. It requires also a constructive order on the points and it has some 
difficulties with circles (quadratic rewriting rules instead of linear ones for lines 
and points). Its generalization to the 3D case has not yet been really investigated. 

In the context of 3D modelisation, none of these approaches yield a satisfac- 
tory answer. The main difficulties that we encounter are: 
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~ The definition of a natural constructive order on the objects, 

~ The construction of reductor sets that can handle many geometric objects 

— The automatic generation of theorems. 

We describe our treatment of these points in the next sections. 

4 Application of Theorem Proving to Computer Vision 

4.1 Building a Constructive Order 

Our configurations are defined by features (points, lines, planes, . . . ) and the 
constraints between them without any information on their construction or their 
degree of freedom. We describe here an automatic tool, which finds the degree 
of freedom for all the features and which sorts the features so that we can build 
the configuration. Such an order will be called constructive order. We start with 
some definitions: 

— The degree of freedom of an object is the dimension of the manifold to which 
this object belongs. In other words, it is the number of parameters of the 
object which can be instantiated randomly. 

— A generic component of a configuration is a component where all the objects 
are distinct and the dimension of the intersection of two objects is the largest 
possible, taking into account all the constraints on the two objects. 

— An object will be called constructible iff its degree of freedom is equal to 
zero. 

— A constraint will be called active iff only one of the two objects which are 
involved in it, is built. 

The Algorithm. It is an iterative algorithm which starts with an empty list 
of built objects and an empty list of active constraints. The initial degree of 
freedom of each object is computed (3 for points and planes, 4 for lines). The 
algorithm stops when all the objects are built. Here are its main steps: 

1. Search one unbuilt object with minimal degree of freedom. 

2. Compute the rewriting rules for it, using its constraints with the previous 
objects. 

3. Build it, that is add it to the list of constructed objects. 

4. Compute the new set of active constraints. 

5. Compute the new degrees of freedom of all the unbuilt objects. 

We discuss now the problems to be solved during these steps. 

How to Choose the Object to Be Built. In step (I), we have to choose 
one object of minimal degree of freedom. In many cases, we will have many 
choices and we choose the object among the objects of smallest dimension. If this 
refinement is not sufficient, we choose the object, which induces the maximum 
number of active constraints. If there still remains several candidates, we choose 
any of them. 
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Constructing Sparse Reductor Sets. A main obstacle of the provers de- 
scribed in Section 3 is their inherent complexity, due to the quick growth of 
the size of the polynomials, with the number of geometric objects. In the appli- 
cations that we consider, the number of objects (e.g. about 50 points, lines or 
planes in the scene “place des arcades” ) is much larger than in classical theorems 
(see [3,5]), although the geometric constraints may be simpler. It implies that in 
characteristic set methods, we deal with many variables and since the polynomi- 
als are interreduced, the output polynomials of the triangular sets will involve 
all these variables and will thus be huge. The modification that we propose to 
such methods consists in replacing dense characteristic sets by sparse triangular 
sets of polynomials, which also allow to reduce every valid geometric property 
to zero. These sets will be called hereafter sparse reductor sets. 

Our modification is specific to geometrical problems, where object configu- 
rations are described by a few kind of geometric constraints (collinearity, copla- 
narity, orthogonality, cocyclicity . . . ) . If a point X is described by such a set of 
hypotheses, the possible configurations for X belong to a finite number of classes, 
going from the generic intersection case to a finite number of degenerate cases. 
For instance, if X is on two projective lines, either these lines are intersecting 
properly and X is their intersection or they are identical and X is any point 
on the line. Once this so called geometric type is known, we can easily derive a 
reductor set describing the position of X. As the position of X depends explicitly 
on few previous points (those which actually appear in the hypotheses on A), 
this yields a reductor set which is sparse. Let us illustrate now this approach, 
which has already been used in [15] for computations in the exterior algebra. 

We consider here an approach using a reference frame, in which we express 
the hypotheses by polynomials 



where Hi are polynomials in the coordinates of the objects, X is the point to 
be built and Ai, . . . ,Uk are the built points. As the geometric constraints on X 
involve specific and simple polynomials in the coordinates of X (usually linear or 
quadratic), the possible geometric type of intersection can effectively be analyzed 
by checking the vanishing (or non-vanishing) of certain resultant polynomials 
in the coordinates of the previous points. For instance, if a point X is in the 
intersection of the linear spaces A = (Ai , . . . , Ak^ ) and B = {Bi , . . . , Bk^ ) The 
intersection of the linear spaces can be defined formally as the last non-zero term 
of the expansion: 



(see [15] for more details). 

Checking the vanishing (or non- vanishing) of these resultant polynomials can 
be performed by using the reductor set associated with these previous points. 




( 1 ) 



D{A, B) = Dk, (A, R) © • • • © Dk,+k2-u{A, B). 
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It is also possible to do it by sampling techniques (choosing random values for 
the free parameters and computing the other). This is the method that we have 
used to construct a sparse redactor set for the scene “place des arcades” , because 
using directly characteristic sets was intractable in this case. In our application, 
we may even skip this checking step, and use external informations, that we 
known from the initial model. 



How to Avoid to the Redundant Constraints. During the update of the 
degrees of freedom in step (5), we have to look to all the active constraints on 
each unbuilt object and to compute the degree of freedom induced by these con- 
straints. This degree of freedom can be computed by reduction of the constraints 
on the new point by a triangular set (in order to remove redundant hypotheses). 
In practise, we do not use this technique because it requires too much memory 
and time resources. We prefer to use a random instantiation of the degrees of the 
parameters defining each built object (its degree of freedom). Once the previous 
points are instantiated, the constraints defining the object to be built defines a 
polynomial system in 3 or 4 variables (3 for points and planes, 4 for lines). The 
computation of a triangular set for such a system is easy and fast. This yields 
the degree of freedom of (or the dimension of the variety described by) the new 
point. If this leads to an impossible construction (an empty variety), we stop the 
computation, backtrack to the last choice step (1) and we choose another point. 



Backtracking. The refinements of the choice described in step (1) are based 
on heuristics and but may lead to a degenerated component. Thus during the 
algorithm, if we detect that one constraint is not satisfied or that we are not in 
a generic component (e.g. two objects are the same), we backtrack to step (1). 
For that, we maintain a list of possible choices at each step of the algorithm and 
process this list until a non-degenerate component is built. If this is not possible, 
we stop and output “failure” . 

Example 1 We consider the following hypotheses, defining a subconfiguration 
of “place des arcades” (Fig. 5): 
seqCpoint (p . i) , i=l . . 4) : 
seqCline (l.i),i=1..2): 
map(x->plane(x) , ) : 

mapCOnObject , [p2,p3] ,H1) ; 
mapCOnObject, [pl,p2,p3] ,H2) : 
mapCOnObject, [pl,p2] ,H11) ; 

0n0bject(p4,H3) ; 
mapCOnObject, [pl,p2] ,11) : 
mapCOnObject, [p2,p3] ,12) : 

PerpendicularConstraint Cll , 12) ; 

PerpendicularConstraint C hi ,H2) ; 

PerpendicularConstraint CH2,H3) ; 
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PerpendicularConstraint (Hll ,H3) : 

If we execute the algorithm on this scene, the following order is obtained: 

[P2, lu Pi. h. P3, H2, i/i]. 

At this stage, it remains to be built p4, et Hu. The degrees of freedom 
and the dimensions are the same for Hu and H^] H^ activates two constraints 
whereas Hu activates that only one constraint. Thus the algorithm chooses the 
plane H3, adds [H3, i/n] to the list “component” . This choice leads us to a degen- 
erated component of the configuration (in this case Hu is equal with 7?2)- Thus, 
the algorithm backtracks, deletes H3 of the list [H3,Hu], and constructs Hu, 
and so on. The constructive order found in this example is 

[P2, h, Pi, h, P3, H2, Hi, Hll, H'i,pi\. 



4.2 Theorem Discovering 

Up to recently, theorem provers were used only to check the conclusion of a 
theorem. However, in our context as explained in Section 2.1, the theorems are 
not inputs of the problem and have to searched in the geometric configurations, 
in order to improve the modelisation process. This task can be performed by 
trial and error, using a geometric prover but it is tedious and requires geometric 
intuition. We are interested in a more systematic approach. 

From an algebraic point of view, any polynomial of the ideal associated with 
the generic configurations is a theorem. If we are looking for relations between 
specific points, we intersect this ideal with the subring of coordinates of these 
points. This is the approach used in [23], the elimination process being carried 
on by Grobner basis computations. 

Our approach proceeds differently: a (sparse) redactor set of the scene is 
computed and used to recover the theorems. It is based on the following simple 
remark: 

Remark 2 The theorems of a configuration correspond to homogeneous invari- 
ant relations of small degree in the bracket algebra. 



The Principle. We begin by introducing some notations in the bracket algebra: 
Let S' be a subset of E, a letter of the bracket algebra on S is either an element 
of S or the linear form 7i defining the hyperplane at infinity, or the bilinear 
form Q defining the metric of the Euclidean space. 

Definition 3. We denote by support of a monomial of the bracket algebra 
on S, the set 0/ letters (counted with their multiplicity) appearing in the mono- 
mial. 

The support of H{A) Ti.{C) Q{A, B) \ A, B, C| \A, D, E\ is 

Supp{n{A) n{C) Q{A, B) |A, B, C\\A, D,E\)= Q, B, C\ D, E. 

We denote by homogeneous component the set of all the monomials with 
the same support. 



220 Didier Bondyfalat et al. 



According to Remark 2, the theorems correspond to relations between mono- 
mials of a same homogeneous component. We thus construct all the monomials 
of a same homogeneous component up to a given degree and consider a linear 
combination of these monomials with indeterminate coefficients. This generic 
polynomial is then reduced by the reductor set either using a reference frame or 
invariants and the remaining system is solved in order to determine the coeffi- 
cients which yield the properties of the configuration. 

Algorithm 4 For a geometrical configuration C: 

1. Choose a reference frame and compute a reductor set RS for the hy- 
potheses ofC. 

2. For each homogeneous component in the bracket algebra, ofC: 

— compute a generic polynomial V in the formal vector parameter u = 
[ui, ■ ■ ■ ,Uk] (where k is the number of monomials in the homogeneous 
component). 

— compute the remainder R of the reduction of P by RS. 

— compute the vectors u such that R = 0 and find their geometric inter- 
pretation. 

Notice that the last step reduces to solving a linear system in u, for the polyno- 
mial P is linear in u and the reduction does not change this property. Moreover, 
theorems correspond to integer solutions of this system. The task of interpre- 
tation of the geometric relations is based on a list of binomial relations corre- 
sponding to basic geometrical properties. We do not have yet a systematic way 
to do it. This will be discussed further in the conclusion. Note however that 
having such an interpretation is not necessary for the application described in 
Section 2.1. 

5 Examples 

This section details two complete examples of the techniques just described. 
These experimentations were performed mostly in maple. The graphs were ob- 
tained with the tool dot. The wire-frame models of Fig. 7 were obtained with 
the tool TOTALCALIB of the Robot vis project. 

5.1 A Complete Example: “La place des arcades” Valbonne Prance 

In this section, we detail the case of the parameterization of the 3D scene depicted 
in figure (Fig. 3). 



Geometrical Constraints and Results after Processing. From the image 
sequence of Fig. 3, we can infer basic relations between the geometric objects: 
some frontages are parallel or perpendicular, the roofs of the building are rect- 
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angular . . . Below are written the constraints that have been used (maple file). 
See Fig. 5 for the notations. 



p8 




## Definition of the objects in the model 
seqCpoint (p . i) , i=l . . 23) , 

map(x->plane(x) , [H1,H2,H3,H4,H5,H6,H10,H11] ) , 
seqCline (l.i) ,i=l. .17) : 



## Incidence constraints: line, plane 
OnObjectCll ,H2) , 

OnObjectCll ,H11) , 

0n0bject(12,Hl) , 

OnOb j ect (17 , H5) , 

OnOb j ect (18 , H3) , 

OnOb j ect (18 , H6) , 

0n0bject(112,H3) , 

0n0bject(112,H10) , 

0n0bject(lll,H10) , 

0n0bject(115,H3) , 

0n0bject(115,H5) : 



## Incidence constraints: point, plcine 
op (map (OnOb j ect , [pl,p2,p3,p4] ,H1)) , 
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op(map(0n0bject , [p3,p4,p5,p6,p7] ,H2)) , 
op(map(0n0bject , [p6,p7,pl0,pll,pl2] ,H3)) , 
op(map(0n0bject , [p6,p8,p9,pl0] ,H4)) , 
op(map(0n0bject , [p9,pl0,pll,pl2,pl3,pl4] ,H5)) , 
op(map(0n0bject , [pll,pl3,pl5,pl6] ,H6)) , 
op(map(0n0bject , [pll,pl2,pl6,pl7] ,H3)) , 
op(map(0n0bject , [pl6,pl7,pl8,pl9] ,H3)) , 
op(map(0n0bject , [pl6,pl9,p20,p23] ,H3)) , 
op(map(0n0bject , [p20,p21 ,p22,p23] ,H10) ) , 
op(map(0n0bject , [p2,p4,p6,pl0,pl4] : 

## Incidence constraints: point, line 
op(map(0n0bject , [p4,p6] ,11) ) , 
op(map(0n0bject , [p4,p3] ,12) ) , 
op (map (OnOb j ect , [p6 , p8] , 13) ) , 
op (map (OnOb j ect , [p6 , plO] , 14) ) , 
op (map (OnOb j ect , [p8 , p9] , 15) ) , 
op (map (OnOb j ect , [p9,pl0] ,16)) , 
op (map (OnOb j ect , [pll,pl3] ,17)) , 
op (map (OnOb j ect , [pll,pl6] ,18)) , 
op (map (OnOb j ect , [pl3,pl5] ,19)) , 
op (map (OnOb j ect , [pl5,pl6] ,110)) , 
op (map (OnOb j ect , [p20,p21] ,111)) , 
op (map (OnOb j ect, [p20,p23] ,112)) , 
op (map (OnOb j ect , [p21,p22] ,113)) , 
op (map (OnOb j ect , [p22,p23] ,114)) , 
op (map (OnOb j ect , [plO,pll ,pl2] ,115)) , 
op (map (OnOb j ect , [pl6,pl7,p23] ,116)) , 
op (map (OnOb j ect , [pl8,pl9,p20] ,117)) : 

## Constraints of parallelism: plane, plane 
ParallelConstraint (H5,H2) , 

ParallelConstraint (H4,H6) , 

ParallelConstraint (H6,H10) : 

## Constraints of parallelism: line, line 
ParallelConstraint (13, 16) , 

ParallelConstraint (14, 15) , 

ParallelConstraint (17, 110) , 

ParallelConstraint (18, 19) , 

ParallelConstraint (111 , 114) , 

ParallelConstraint (112 , 113) : 

## Constraints of orthogonality: plane, plane 
PerpendicularConstraint (Hll ,H3) , 
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PerpendicularConstraint (Hll ,H2) , 
PerpendicularConstraint (HI ,H2) , 
PerpendicularConstraint (H3,H2) : 



## Constraints of orthogonality: line, line 
PerpendicularConstraint (11 , 12) , 

PerpendicularConstraint (14,16) , 

PerpendicularConstraint (17,19) , 

PerpendicularConstraint (111 ,112) : 

Algorithm 1 is used to obtain a constructive order (using the function Ordre) 
which gives the following list of objects sorted in the increase order of construc- 
tion: 

[p6, 11, p4, 12, p3, H2, HI, Hll, H3, plO, H5, 14, 115, pll, p2, 
p7, pl2, pl4, 16, 13, p8, H4, 15, p9, H6, 18, pl3, 17, 19, pl6, 
110, pl5, HIO, 112, p20. 111, p23, 114, 116, p21, 113, p22, pl7, 
pl8, 117, pl9, pi, p5] . 

This constructive order can be represented by the dependence graph (Fig. 6) 
where each node is associated to one object (ellipse nodes represent points, 
rectangular nodes, planes and diamond nodes, lines). The color of node encode 
the degree of freedom of the object represented by the node (darkgrey for three, 
grey for two, lightgrey for one and white for zero). Each edge is associated to 
one geometrical constraint. The orientation of the edge shows the dependence 
between the nodes. 

Now, we can compute a triangular set of the polynomial equations of the 
geometrical constraints. The variable order is described below. The independent 
variables appear in the first list, and in the second, are the variables which are 
not free. 

[p6[l] ,p6[2] ,p6[3] ,11[1] ,11[2] ,p4[l] ,12[1] ,p3[l] ,pl0[l] ,pll[2] , 
p2[l] , p7[l] ,pl2[l] ,pl4[l] ,16[1] ,p8[l] ,pl3[l] ,pl6[l] ,H10[1] , 
p20[l] ,p23[l] , p21[l] ,pl7[l] ,pl8[l] ,pl8[2] ,pl9[l] ,pl[l] , 
pi [2] ,p5[l] ,p5[2]] 

[11 [3], 11 [4], p4[2], p4[3], 12 [2], 12 [3] , 12 [4], p3 [2] , p3 [3] , 
H2[l], H2[2], H2[3], HI [1] , HI [2] , HI [3] , Hll[l], Hll [2], Hll [3], 
H3[l], H3[2], H3[3], pl0[2], pl0[3], H5[l], H5 [2] , H5 [3] , 14 [1], 

14 [2], 14 [3], 14 [4], 115 [1], 115 [2], 115 [3], 115 [4], pll [2], 
pll [3], p2[2], p2[3], p7[2], p7 [3] , pl2[2], pl2[3], pl4[2], 
pl4[3], 16 [2], 16 [3], 16 [4], 13 [1], 13 [2] , 13 [3], 13 [4] , p8 [2] , 
p8[3], H4[l], H4[2], H4 [3] , 15 [1], 15 [2], 15 [3] , 15 [4], p9[l], 
p9[2], p9[3], H6[l], H6[2], H6 [3] , 18 [1], 18 [2] , 18 [3], 18 [4] , 
pl3[2], pl3[3], 17 [1], 17 [2], 17 [3], 17 [4] , 19 [1], 19 [2] , 

19 [3], 19 [4], pl6[2], pl6[3], 110 [1], 110 [2], 110 [3], 110 [4], 
pl5[l], pl5[2], pl5[3], HIO [2], HIO [3], 112 [1], 112 [2], 112 [3], 
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Fig. 6. Dependence graph of “place des arcades' 
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112 [4], p20[2], p20[3], 111[1], 111 [2], 111 [3], 111 [4], p23 [2] , 

p23[3], 114[1], 114[2], 114[3], 114[4], 116[1], 116[2], 116[3], 

116 [4], p21[2], p21[3], 113 [1], 113 [2], 113 [3], 113 [4], p22[l], 

p22[2], p22[3], pl7[2], pl7[3], pl8[3], 117 [1], 117 [2], 117 [3], 

117 [4], pl9[2], pl9[3], pi [3], p5 [3] ] 

Below, is the triangular set computed with respect the variable order described 
above. This triangular set has 131 polynomials. Here are the first and last poly- 
nomials of this set: 

[p6[3] 11 [3] - p6[l] + 11 [1] , 
p6[2] - 11 [2] - p6[3] 11 [4] , 

11 [4] p4[l] - 11 [4] 11 [1] - 11 [3] p4[2] + 11 [3] 11 [2], 
p4[2] - 11 [2] - p4[3] 11 [4] , 



H3[3] pl9[2] - H3[3] 117 [2] + 117 [4] pl9[l] H3[l] 

+ 117 [4] pl9[2] H3[2] + 117 [4], 
pl9[l] H3[l] + pl9[2] H3[2] + pl9[3] H3 [3] + 1, 
pl[l] Hl[l] + pi [2] HI [2] + pi [3] HI [3] + 1, 
p5[l] H2[l] + p5[2] H2[2] + p5 [3] H2 [3] + 1]. 

The corrected model after adjustment is shown in Fig. 7. 



5.2 Examples of Theorem Discovering 

We illustrate now the method proposed for discovering theorems in three dif- 
ferent contexts. The first context is related to characteristic sets, the second to 
direct rewriting rules on points and the last one to computations in the exterior 
algebra. 



Example 5 (The median configuration) Let ABC be a triangle 
and A',B\ C be the middle points of [BC], [AC], [AB], respectively. 

For this configuration of n = 6 points, we have (g) = 20 brackets and 5(g) -I- 
(g) =31 homogeneous components of degree 2. For instance choose one of these 
31 components: 



\A,B,C\\J^,B',C'\, \A,B,^\\C,B',C'\, \A,B,B'\\C,^ ,C'\, 

\A,B,C'\\C,A',B'\,\A,C, A'\\B,B',C'\,\A,C,B'\\B,A',C'\,\A,C, C'\\B,A',B'\, 
\A,A',B'\\B,C,C'\, \A,A',C'\\B,C,B'\, \A,B' ,C'\\B,C,A'\ . 

We form the generic polynomial 

P = uM.B,C\\A',B',C'\ + ■ ■ ■ + uw\A,B',C'\\B,C,A'\ , 
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Fig. 8. The median configuration 



and compute a redactor set RS using geometric rewriting rules: 

B'^^{A + C), 
a^^{A + B). 
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Then, we compute the reduction of P by RS: 



R = \A, B, Cp ( — M3 + M2 — Mg + 2 mi — M5 + Mg — M7) . 



We solve the diophantine equation (linear system over Z): 



— For the solution M5 = 1, Mg = —1, Mi = Mg = M3 = M4 = Mg = M7 = Mg = 
Mio = 0, we obtain this bracket equation: 

\A,C,A'\\B,B',C'\ - \A,A',C'\\B,C,B'\ = 0 . 

We can recognize the property the lines {AA'), {BB'), (CC) are concurrent. 



— Another solution is Mg = M5 = 1, mi = M 3 = M4 = Mg = M7 = Mg = Mg = 
Mio = 0 . Thus, we obtain this bracket equation: 

\A,A\B\\B\C',C\ + \A,A\C\\B\C\B\ = 0 , 

which means that the lines (B'C) and (BC) are parallel. 

The proof: A is the meet point of {AA') and (BC). I is the meet point of 
{B'C) and {BC), thus we have: 



{A',I-B,C) 



\A,A',B\\B',C,C\ 
\A,A',C\\B',C,B\ ’ 



which by hypothesis is equal to —1. So the points B, C, A' , I are in harmonic 
division. But, the fact that A' is the middle point of [BC] implies that I is 
at infinity. Consequently, {BC) and {B'C) are parallel. 



Example 6 (Configuration of a rectangle) Let ABCD a parallelogram 
such as AC = BD. Choosing a reference frame, A(0,0), i?(xi,0), C{x 2 ,x^), 
D{x/i,x^), the hypotheses are: 



{AB) II {CD) -xi{x2 - X4) = 0 , 

{AD) II {BC) x/iXz — XiXi — X5X2 = 0 , 

AC = BD X2 + x^ — x^ — {—X5 + xi)^ = 0 . 

We compute a characteristic set CS: 

xfx2{xi - X3) , 

Xi(xg - X4) , 

Xsxfx2 ■ 
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We add the points of the absolute conics I and J, so we have 6 points and 20 
brackets and 31 homogeneous components. We choose the homogeneous compo- 
nent of support: B, D, /, J. 

P = ui\A, B, D\\A, /, J| + u2\A, B, I\ I^, D, J\ + us\A, B, J\\A, D, I\ . 

Then, we compute the pseudo-remainder R of P hy CS: 



R = xfxl {2ui +U2 — U3), 
and solve the diophantine equation: 



2ui + U2 — U3 = 0 . 

One solution is ui = 0 and U2 = M3 = 1, 

\A,B,I\\A,D,J\ + \A,B,J\\ADJ\ = 0. 

This equation says that the lines (AB) and (AD) are perpendicular. 

Example 7 (Configuration of “place des arcades”) We want to find the 
properties involving orthogonality relations in the sub-configuration of Fig. 9. 





Fig. 9. A sub-configuration of ’’place des arcades” 



First, we defined a constructive order using the method described in Sec- 
tion 4.1. Secondly, we construct a reductor set of the configuration, based on 
rewriting rule in the exterior algebra. We have used here the dual operator *, 
which computes the normal to a plane. We assume here that the orthogonality 
defined by duality is a same that the orthogonality defined by the quadric Q. 
We obtain the following rewriting rules: 
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h - 


Pa 7\pe 


Pa - 


-> H 2 V k 


h - 


{k V 7 Y) Apg 


Hi- 


Pe A pg A pio 


ts - 


pg A Pio 


P9 - 


X 5 Pio + Xe P12 -1- (1 - X 5 - xe){{Ui A U 2 ) V Hg) 


Pll - 


xaPio + (1 - X 3 ){lr V H) 


17- 


-^He\J He 


Hi - 


Pa A Pi A {H2)* 


Pa - 


X2Pi + {lx2){h V H) 


h - 


{{hyn)*yn) Api 


P4 - 


xipe + {1 - xi){h V Ti) 


h - 


-^H2V Ha 


k- 


He V Ha 


Pio - 


Ha V He V He 



i?5 ^ (i?2 V Ti) ^Pl2 
H 2 —>■ P6^P7 ^ {Hq)* 
He ^ Pe Apr A pi2 . 



In this case, the classical techniques using coordinates failed due to the compu- 
tational memory resource. 

Then, we compute the different homogeneous components involving 
and concentrate on the homogeneous component in the points p 4 , pe, Pe, P 9 - In 
this case, we have only one generic component 

uiUpa a Pe) V Ti)* A ((ps A pg) V TC) + tt2((p4 A pg) V Ti)* A {{pe Apg) V Ti) 
+U 3 {{p 4 A pg) V H)* A {{pe A pg) V H) + u^dpe A pg) V H)* A ((p4 A pg) V H) 
+uei{p6 A pg) V H)* A ((p 4 A pg) \/ H) + ue{{pa A pg) V H)* A ((p 4 A pe) V H) . 

and check that the only valid relation is for itg = • • • = ug = 0, that is (p 4 ,pe) 
orthogonal to (pg , pg ) . 

6 Conclusion 

This paper describes an application of theorem proving to 3D modelisation prob- 
lems in computer vision. We combine the construction of a reductor set with the 
analysis of the graph of constraints, in order to compute a correct parameteri- 
zation of our 3D model. This parameterization is then used in a minimization 
process to fit the parameters defining the model with its images in the cam- 
era(s). We propose a new technique for sorting points and constructing sparse 
reductor sets. This method uses a coordinate system and mixes symbolic and 
numeric computations. This enables us to treat systems that were intractable by 
Grobner basis or characteristic set methods, but many improvements may still 
be considered. 
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We also focus on discovering theorems, which can be used in the numeri- 
cal steps in the 3D reconstruction, described in Section 2.1. Our approach uses 
the bracket algebra formalism, which has the advantage to deal with invariant 
quantities (like determinants of planar points) which can be useful for direct 
computations from the images. This formalism also allows the decomposition of 
algebraic expressions into homogeneous components that are used to find the 
geometric properties of the configuration. This task can be easily parallelized, 
for the computation in each component is independent of the others once the 
reductor set is available. Indeed, in practice, we are running several reduction 
processes (one for each homogeneous component) in parallel on different work- 
stations. This homogeneity also permits to restrict our search to a subset of 
points of the configuration or to special classes of relations (projective, affine. 
Euclidean). A drawback of the approach is the fast growth of the number of 
components with the number of geometric features and the degree of the mono- 
mials. Fortunately, interesting properties in a figure correspond to relations of 
small degree. An interesting feature of our approach is that reductor sets are 
computed only once. 

Among the points that need improvements, we would like to avoid as much 
as possible the backtracking in the construction of the order on the points. One 
solution would be e.g. to add new and trival connections in the graph that can 
be inferred directly from the geometric hypotheses. Interpretation of relations 
obtained with Algorithm 4 in terms of geometric properties may be automatised 
using Cayley factorization techniques [24]. As a final remark, the numerical qual- 
ity of the parameterization of the model also needs a better understanding and 
deeper analysis, as it is used in a numerical optimization process. 
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Abstract. This paper reviews three main techniques for automated ge- 
ometry diagram construction: synthetic methods, numerical computation 
methods, and symbolic computation methods. We also show how to use 
these techniques in parametric mechanical CAD, linkage design, com- 
puter vision, dynamic geometry, and CAI (computer aided instruction). 
The methods and the applications reviewed in this paper are closely 
connected and could be appropriately named as engineering geometry. 



1 Introduction 

In the area of automated geometry reasoning, most of the efforts have been con- 
centrated on automated geometry theorem proving and discovering. These efforts 
lead to the classic work of Tarski [61], Gelernter [29], and Wu [68]. On the other 
hand, automated geometry diagram construction (AGDG) is more or less over- 
looked. Actually, AGDG has been studied in the GAD community under a differ- 
ent name: geometric constraint solving (GGS) and with a different perspective: 
engineering diagram drawing. GGS is the central topic in much of the current 
work of developing intelligent or parametric GAD systems [2,5,21,33,43,53,59,63] 
and interactive constraint-based graphic systems [3,32,60]. The main advantage 
of using the constraint approach is that the resulting systems accept declar- 
ative descriptions of diagrams or engineering drawings, while for conventional 
GAD systems the users need to specify how to draw the diagrams. As a result, 
parametric systems are more powerful and user friendly. 

Three main approaches have been developed for automated diagram con- 
struction: the synthetic approach, the numerical approach, and the symbolic 
approach. 

In the synthetic approach, a pre-treatment is carried out to transform the 
constraint problem into a step-by-step constructive form which is easy to draw. 
A majority of the work is to transform the problem to a constructive form that 
can be drawn with ruler and compass. Once a constraint diagram is transformed 
into constructive form, all its solutions can be computed efficiently. Two main 
techniques used in the synthetic approach are the rule-hased search and the graph 

* This work was supported in part by an Outstanding Youth Grant from the Chinese 
NSF and the National “973” Project. 
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analysis. The basic idea of the rule-based search [2,5,21,59,63] is to represent the 
knowledge of geometric construction as deductive rules and use various search 
techniques from AI to find a step-by-step construction procedure for a con- 
strained diagram. The basic idea of the graph analysis method [33,46,43,53,62] 
is to represent the geometric conditions as a graph and use various techniques 
from graph theory to attack the problem. 

In the numerical approach, geometric constraints are translated into alge- 
braic equations and various numerical techniques are used to solve these equa- 
tions. The first generation constraint-based systems, such as Sketchpad [60] and 
ThingLab [3], used numerical relaxation as the last resort of solving constraint 
problems. Newton iteration method was used in Juno-2 [32] and in the Varia- 
tional Geometry [51]. The main advantage of numerical methods is their fast 
speed and generality. On the other hand, it also needs further improvements 
in several aspects: like finding stable solutions during animation and finding all 
solutions of equation systems. 

In the symbolic approach [42,22], we also transform the geometric constraints 
into algebraic equations. Instead of using numerical methods to solve the alge- 
braic equations directly, we first use general symbolic methods such as Wu-Ritt’s 
characteristic set method [68], the Grobner basis method [7], the Dixon resul- 
tant method [39], and Gollins’ cylindrical algebraic decomposition method [15] 
to change the equation set to new forms which are easy to solve, and then solve 
the new equations numerically. The advantage of the symbolic computational ap- 
proach is that it may provide complete methods for many problems in AGDG. 
On the other hand, current symbolic methods are still too slow for real time 
computation. 

The above AGDG techniques can be applied to a variety of engineering prob- 
lems: parametric mechanical design, linkage design, robotics, computer vision, 
and geometric modeling. They can also be used in dynamic geometry, physics, 
and GAI (computer aided instruction) . We call the methods and the applications 
reviewed in this paper engineering geometry. 

This paper is an attempt to give an overview of the research on engineering 
geometry with emphasis on the work done by the extended Wu group. The reader 
may find details for other related work based on the given references. In the next 
section, we introduce two synthetic approaches: the global propagation and the 
graph decomposition approach. In Section 3, we introduce the application of 
the numerical optimization methods to AGDG. In Section 4, we introduce the 
symbolic approach based on Wu-Ritt’s characteristic set method. In Section 5, 
we introduce dynamic geometry as a tool of generating diagrams. In Section 6, 
we discuss applications of these techniques. 

2 Synthetic Approaches to AGDC 

Synthetic approaches provide a geometric way of solving algebraic equations. 
They can generate all solutions of a diagram efficiently and with high precision. 
The solutions provided by this kind of approaches are stable and have geomet- 
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ric meanings. So in most drawing systems, synthetic approaches are first used 
to solve the diagram construction problem. If synthetic approaches fail, other 
approaches will be used. 

In this section, we will give a brief introduction to two synthetic approaches: 
the global propagation approach [21] and Owen’s graph decomposition 
approach [.53]. Other important synthetic approaches include Hoffmann’s graph 
analysis approach [33], Kramer’s freedom of degree analysis approach [43], Lath- 
eam and Middleditch’s approach based on weighted graphs [46], Lee’s optimized 
rule-based method [47], and rule-based methods proposed in [1,64]. 

2.1 The Global Propagation Method 

Local Propagation is a basic AI technique to solve constraint problems [57,49]. 
It is to find unknown quantities one by one from the set of known quantities. 
For instance, from the relation 5F — 9c = 160 about two kinds of measurement 
of temperature c and F, we may find the following propagational rules: 

1. If c is known, we may compute F with the formula: F' = |c-|- 32. 

2. If F is known, we may compute c with the formula: c = , 

For AGDC, local propagation is closely related to the method of locus intersec- 
tion. For instance, to draw a triangle ABC with sides AB = 4,BC = 5, and 
CA = 6, we first draw an arbitrary point as point A. To draw point B, we first 
draw a circle (locus) with center A and radius 4, and point B is an arbitrary 
point on this circle. To draw point C, we first draw two circles (loci) with cen- 
ters A and B and radii 6 and 5 respectively, and C is the intersection of these 
two circles. 

It is clear that the power of local propagation is limited. It can not solve con- 
straint problems with loops, i.e., more than one geometric objects are constrained 
or referred to simultaneously in the problem. Global propagation is proposed to 
solve constraint problems with loops [21]. Like the local propagation method, it 
tries to determine a geometric object with the locus intersection method. But 
the global propagation uses not only the constraints involving this object but 
also implicit information derived from other constraints. The global informa- 
tion needed in the propagation comes from a geometry information base (GIB) 
built before the construction begins. The GIB for a configuration is actually a 
database containing all the properties of the configuration that can be deduced 
by using a fixed set of geometric axioms [14]. 

Figure 1 is an example from [33]. Starting from point B, it is easy to construct 
B,C,X. Next, we want to construct D. Since \XD\ is known, we need only to 
know the direction of XD which can be determined by the following global 
propagation of line directions: XD _L DE, L{DEA) = a^, AE _L AB, and 
1{ABC) = oi. 

Generally speaking, to determine the direction of a line L, we first consider 
some basic situations: 

1. L passes through two known points. 
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The constraints: 

IBCI = dl,IEDI = d2, 

IXCI = d3, IXDI = d3 

ANGLE(ABC) = al, ANGLE(DEA) = a2 
PERP(X,C,C,B), PERP(X,D,D,E), PERP(B,A,A,E) 



Fig. 1. An example to show the propagation of line direction 



2. L cuts a fixed length between a pair of parallel lines. 

3. L cuts a pair of segments on the sides of a known angle such that the ratio 
of the segments is known. 

4. L cuts a pair of segments between three known lines passing through a point 
such that the ratio of the segments is known. 

Secondly, line directions can be propagated as follows: 

h Tl I2 T2 ... Ts-l Is, 

where li are lines and represent the fact that li and Zi+i form a known angle. 
If the direction of Ig is determined by one of the basic situations, the direction 
of line 1 1 is known by the above propagation. This process clearly demonstrates 
the global feature of the propagation. 

An important technique in the global propagation is to use LC transforma- 
tion. An LC transformation is a one-to-one map from the Euclidean plane to 
itself which transforms a line to a line and a circle to a circle. We mainly use 
two kinds of LC transformations. 

The first LC transformation is as follows. Suppose that segment AB has a 
fixed direction and a fixed length. Then point A is on a line or a circle if and 
only if point i? is on a line or a circle. An example is given in Fig. 2. Suppose 
that points A and C have been constructed. Next, we construct point B. Since 
LABC = oi, B is on a circle C\. Similarly, point D is on a circle €2- Since BD 
has a fixed direction and a fixed length, point B must be on another circle C3, 
which is the transformation of circle C2. i? is the intersection of circles C2 and C3. 
To construct C3, we first construct the center X\ of circle C2 and then move it 
for a distance of \BD\ along the direction of BD to obtain the center X2 of C3. 
The radius of C3 is the same as that of C2. 

The second LC transformation is related to ratio constraints. If points P, Q, R 
are collinear, PQ/RQ is known, and one of them, say Q, is already known, then 
point P is on a line or a circle if and only if point i? is on a line or a circle. 
Consider the constraint problem in Fig. 3. Suppose that we have constructed 
points A and B. Since 1 {ACB) = ai, point C is on a circle Ci. We construct 
the center X2 of circle Ci. Since M is the midpoint of BC, by the second LC 
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A 



Xi 



X2 



The constraints: 
lACI = dl, IBDI = d2 

ANGLE(ABC) = al, ANGLE(ADC) = a2 
ANGLE(BOC) = a3 



Fig. 2. An example to show the first kind of LC transformation 




c 






B 



The constraints: 

IABI = dl,IAMI = d2 
MID(M,B,C), ANGLE(ACB) = al 



Fig. 3. An example to show the second kind of LC transformation 

transformation, point M is on another circle C 2 . Hence M is the intersection of 
circle C 2 and the circle with A as center and with d ,2 as radius. 

On the top level, the method works as follows [21]: 

1. For a constraint problem 



let CT = {Cl,..., Cm} be the constraint set, CS = 0 the construction 
sequence, QS = (Qi, . . . , Qm} the points with given construction order, and 
PS = {P\, . . . , Pn} the remaining points. We assume that the problem is 
not over-constrained, i.e., we have jCTj < 2 * \PS\ — 3. 

2. Build the GIB as described in [14]. Then repeat the following steps first 
for QS and then for PS until both QS and PS become empty. 

3. Take a point P from QS or PS. For each constraint Ti G CT involving P, 
decide the locus Lc of the points satisfying Ti, assuming that all points 
constructed in CS are known. We then obtain a set of triplets 



Three cases are considered. (1) s = 0. Point P might be an arbitrarily chosen 
(free) point. We add a new construction C = (POINT, P) to CS. (2) There 
exist i ^ j such that Ti yf Tj and Lci and Lcj are not parallel lines or 




{{P,T^,Lc^),...,{P,T,,Lcs)}. 
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concentric circles.^ We add a new construction C = (INTERSECTION, P, 
Lci, Lcj) to CS and remove Ti and Tj from CT. (3) Otherwise, point P 
is a semi-free point: it can move freely on a line or a circle. We add a new 
construction C= (ON, P, Lci) to CS and remove Ti from CT. 

4. Now we check whether the remaining problem is over-constrained, that 
is, whether \CT\ > 2 * IPS'! (note this inequality is different from the one 
in Step 1). (1) If it is over-constrained then the construction sequence is 
invalid in this case. If P is from QS, the order given by the user can not 
be constructed and the process terminates. Otherwise, restore the removed 
constraints and repeat the preceding step for a new point from PS. (2) If 
it is not over-constrained, point P is constructed. We need to repeat the 
preceding step for a new point. 



2.2 Owen’s Graph Decomposition Method 

In the graph analysis approach, a constraint problem is first transformed into a 
graph whose vertices and edges represent the geometric objects and constraints 
respectively. A constraint problem without loops can be represented as trees [62]. 
Owen introduced a graph analysis method that can handle constraint problems 
with loops [53]. Basically speaking, Owen’s algorithm has two steps: 

1. Decompose the graph into sub-graphs. A graph is called bi-connected if it is 
connected and we cannot find a vertex such that by removing this vertex the 
graph becomes two graphs. A graph is called tri-connected if it is bi-connected 
and we cannot find two vertices such that by removing these two vertices 
and the edge (if there exists one) connecting them, this graph becomes two 
graphs. The decomposition procedure is first decomposing the graph into 
bi-connected graphs and then decomposing the bi-connected graph into tri- 
connected graphs by using algorithms from graph theory [30]. At the final 
stage, every sub-graph must has less than three vertices. Otherwise, the 
algorithm cannot handle the problem. 

2. Assemble the sub-graphs into larger ones. This step is the reverse of the first 
step and has two sub-cases for a well-constrained problem. 

(a) If two vertices are removed during the decomposition and both vertices 
are connected with the same unknown vertex V, then we can construct 
the geometric object represented by V in the next step. For instance, if 
the two known vertices represent lines and V represents a point P, then 
we will take the intersection of the two lines to obtain point P in the 
next step. 

(b) If a graph is decomposed into three sub-graphs in such a way that each 
pair of sub-graphs have a common vertex, then we can assemble the 
three sub-graphs together. Since the problem is well-constrained, each 
sub-graph represents a rigid part. Let the common vertices represent, 

^ We can check these conditions by using automated theorem proving methods such 
as Wu’s method [9,68] or the area method [13] which are quite fast. 
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say, three points. Since each pair of the points is in a rigid part, we 
can compute the distances between them. Then the assembling problem 
becomes the construction of a triangle with three known sides. 

Let us consider the problem in Fig. 1. Figure 4 (a) is the graph of this 
constraint problem, where /i, ^ 2 , ^ 3 , ^ 4 , ^ 5 , ^6 lines (not segments) AB, BC, 
CX, XD, DE, EA respectively. We will solve this problem in two steps. 




1. Decompose the graph into sub-graphs. We first decompose the graph into 
three sub-graphs by breaking the graph at vertices X, li, Iq. The sub- 
graph IiIqA needs no further decomposition, since it contains three vertices 
only. The other two sub-graphs are similar. Figure 4 (b) gives the decompo- 
sition for one of them. 




Fig. 5. Three sub-diagrams of the diagram in Fig. 1 



2. Assemble the sub-graphs into larger ones. The three sub-graphs obtained in 
the first step of decomposition represent the three diagrams in Fig. 5, which 
are easy to construct with ruler and compass. The critical step is to assemble 
the three parts into one. Let us note that the respective common vertices 
are X , li, and le- Since each of the three parts is a rigid body, we know the 
distances from X to h and Iq, and the angle formed by h and Iq. Then we can 
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easily draw a diagram containing X, l\ and The remaining constructions 
are easy. 

According to the above algorithm, we cannot solve the problem in Fig. 2. 

3 Numerical Computation Approaches to AGDC 

Numerical methods are usually used as the last resort in the constraint solving 
process, because they can be used to solve any kinds of equation systems. The 
most commonly-used method in the numerical approach is the Newton-Raphson 
method. It is fast, but has the instability problem: the method is sensitive to the 
initial values. A small deviation in the initial value may lead to an unexpected 
or unwanted solution, or to an iteration divergence. To overcome this difficulty, 
recently the homotopy method has been proposed and experimented with [45]. 
According to the report in [45], generally the homotopy method works much 
better in terms of stability. These two methods generally require the number of 
variables to be the same as the number of equations. 

In [28] , a method based on the optimization techniques for solving geometric 
constraint problems is proposed. Experiments with this method show that it is 
also quite stable. Furthermore, the method can naturally deal with under- and 
over-constrained problems. 

Generally, a geometric constraint problem can be first translated into a sys- 
tem of equations: 

..,Xn)=0, 
f2{xi,. ..,Xn)=0, 

fm{xi^ . . . , Xn) — 0 . 

The optimization approach solves the equation system by converting it into 
finding the X at which the sum of squares 

m 

a{X) = Y,MXf 

i=l 

is minimal, where X = (xi, X 2 , . . . , x„) is the set of variables. It is obvious that 
the equation system has a real solution X* if and only if min cr(A') is 0. The 
problem of solving a system of equations is thus converted into the problem of 
finding the minimum of a real multi-variate function. The problem now can be 
solved by various well-developed numerical optimization methods [52,17]. 

One obvious fact for this approach is that the number of equations m is not 
necessarily the same as the number of variables n. Thus for this approach, it 
is natural to deal with under- and over-constrained problems. Another advan- 
tage of this method is that it is not sensitive to the initial value. This feature 
is demonstrated in Fig. 6 where the three circles are specified to be mutually 
tangent and to be tangent to two neighboring sides of a triangle whose three 
vertices are specified to be fixed. Figures 6(a) and 6(c) are the initial diagrams 
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sketched by the user. It is easy to see the differences between the initial guesses 
from the exact solutions of this problem. Also, this figure demonstrates how 
different initial values lead to different branches of the solutions. 




(d) 

Fig. 6. The insensitivity to the inexact initial guesses of the 
solutions in the optimization method. 







Fig. 7. A diffi cult problem, (a) is the initial diagram drawn 
by the user, (b) is the diagram generated after all tangent 
constraints are added 



Figure 6 is the simplest case for the problem we call the tangent paeking 
problem. The problem is to pack n(n + l)/2 circles (n rows of circles) tangent 
to adjacent circles and/or the adjacent neighboring sides of a given triangle. 
Figure 7 is the case of n = 6, i.e., we need to pack 21 circles in the triangle. This 
difficult problem contains 174 variables (since we introduce auxiliary tangent 
points) and 6 linear equations and 168 quadratic equations which could not 
be block triangularized. Table 1 shows the running time for different n of this 
problem. 
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Table 1. Running statistics for the tangent packing problems 



# circles (# rows) 


# equations 


# variables 


time 


3 (2) 


30 


30 


0.228 


6 (3) 


54 


54 


0.965 


10 (4) 


86 


86 


3.379 


15 (5) 


126 


126 


11.587 


21 (6) 


174 


174 


23.751 



4 Symbolic Computation Approaches to AGDC 



Theoretically, both the synthetic and numerical approaches are not complete. 
Synthetic approaches are not decision procedures even for the case of ruler and 
compass construction, i.e., for many problems that can be drawn with ruler and 
compass, they cannot give a drawing procedure. Using numerical approaches, if a 
solution is not found, we cannot assert that the system does not have a solution. 
Also, if we find one or two solutions of a system, we do not know whether 
these are all the solutions of the system. To solve these kinds of completeness 
problems, we may use the symbolic computation techniques such as Wu-Ritt’s 
characteristic set method [68], the Grobner basis method [7], the Dixon resultant 
method [39], and Collins’ cylindrical algebraic decomposition method [15]. For 
instance, using Wu-Ritt’s characteristic set method [68] we can give decision 
procedures ([22]) 

1. to construct a constrained diagram with ruler and compass (rc-constructi- 
bility); 

2. to detect whether a constrained diagram is well-(under-, over-) constrained; 

3. to detect whether a set parameters are conflicting, and if they are, find the 
relation among them; and 

4. to detect whether a constraint is redundant. 



Let us consider how to draw a regular pentagon ABODE with edge r. Let 
A = (0,0), B = (r,0),C = (x 2 ,y 2 ),D = (x^,y^),E = (xA,yA)- Since there are 
five points in the diagram, we need 2 * 5 — 3 = 7 constraints. The constraint 
\AB\ = r is already satisfied automatically. We need six other constraints: 



\AB\ 

\AB\ 

\AB\ 

\AB\ 

\AD\ 

[AC] 



\AE\-. hi=yl + x\-r'^ = 0 
\BC\-. h 2 = y 2 + ^2 — 2rx2 = 0 

\CD\: h 3 = yl~ 2y2ys + a;§ - 2 x 2 X 3 + yj + xl - = 0 

\DE\: hi = yl~ 2ysyA + xl - 2 x 3 X 4 + yj + xj - = 0 

\DB\: = 2rx3 — = 0 

\CE\: he = yl -2y2y 4 + xl~ 2 x 2 X 4 = 0. 



Wu-Ritt’s characteristic set method [68] can be used to represent the zero set 
of any polynomial equation system as the union of the zero sets of polynomial sets 
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in triangular form. There are methods to find all solutions of a set of polynomial 
equations in triangular form. Let Zero{PS) be the set of all zeros or solutions of 
the polynomial equation system PS and Zero{PS/D) = Zero{PS) — Zero{D) 
for a polynomial D. Using Wu~Ritt’s characteristic set method to the pentagon 
problem, we have 

Zero{{hi, he}) = Ui^.i^Zero{TSJIi) 
where TSi and It are given below. 



TSi = 


TS2 = TSs = 


TSa = 


2/4 - 2/2 


ryA + {2x2 - 2r)y2 yA ~ 2/2 


2/4 


Xa-\- X2 — r 


2xa — 2x2 + x xa-\- X2 — r 


XA-\-r 


rys + {-2X2 + r)y2 


rye -|- {2x2 ~ 3r)?/2 rye -|- {2x2 ~ 3r)j/2 


2/3 - 2/2 


2x3 - r 


2xe — r 2xe — r 


2xe - r 


4y| — 2rx2 — 


4y| — 2tx2 — 4j/2 — 2rx2 — 


4 y 1 -\- 5r' 


4 x 2 ~ 6rx2 + 


4a;2 — &rx2 + 4 x 1 ~ ^rx2 + 


2x2 + r. 



h = I 2 = h = r and I a = 1. TSa does not have non-zero solutions, since it 
contains 4y| -|- 5r^. Each of TS\, TS 2 , and TS 3 contains six polynomials and 
has total degree four, i.e., for a non-zero value of r, each of TS\, TS 2 , and TS 3 
generally gives four solutions to the problem (see Fig. 8). 

4.1 Well-, Under-, and Over- Constrained Systems 

A constraint system is called 

— well-constrained if the shape of the corresponding diagram has only a finite 
number of cases; 

— under- constrained if the shape of the corresponding diagram has infinite 
solutions; 

— over-constrained if the corresponding diagram has no solution. 

These properties are quite difficult to detect, so people usually use another kind 
of definition. A constraint system is called 

— structurally well-constrained if each of its sub-diagram with n points and 
lines has 2 * n — 3 constraints; 

~ structurally under- constrained if some of its sub-diagrams with n points and 
lines have less than 2 * n — 3 constraints; 

— structurally over-constrained if some of its sub-diagrams with n points and 
lines have more than 2 * n — 3 constraints. 

It is an interesting problem to study what kinds of structurally well- 
constrained systems are well constrained. In [54], some results are given. But in 
my viewpoint, these kinds of results are more theoretical-oriented than practical- 
oriented. Actually we can find many examples to show that structurally well- 
and over- constrained diagram could be under-constrained and structurally well-, 
under-, and over- constrained diagram could be over-constrained. 
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We use the pentagon example to show that sometimes we need more than 
2n — 3 constraints to define a constraint system with n points. The following 
table gives the result of checking three more geometric conditions on the three 
diagrams defined hyTSi,TS 2 , and TS 3 by using the automated theorem proving 
methods given in [68]. 





TSi TS 2 TSs 


II II II 


[AC] 

\BE\ 

\EC\ 


true false false 
true true false 
true false true 




Fig. 8. Three components for the regular pentagon problem 



It is clear that the pentagon problem is well-defined. But the seven indepen- 
dent constraints do not properly define a regular pentagon. To define a regular 
pentagon, we need to add at least another constraint 

\AD\ = \AC\: hT = yl + xl~yl-xl. 

Now there is only one useful triangular set TS\ in the decomposition: 

Zero{{hi , . . . , h-j}) = Zero{TS\/ Ii) U Zero^TS^jlA)- 

But this constraint system is structurally over-constrained. Generally, we 
have [22] 

Theorem 1. Using Wu-Ritt’s characteristic set method, we may decide whether 
a constrained diagram is well-, over-, or under- constrained over the field of 
complex numbers. 

4.2 Independent and Conflicting Constraints 

A set of constraints is said to be independent if no one of them can be removed 
and the new constraint system still defines the same diagram. A set of con- 
straints is said to be eonflicting if no diagram in the Euclidean plane verifies this 
constraint system. 
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Let us first comment that in the general case there is no connection between 
the concepts of independency and the critical number 2n — 3. Actually, there are 
conflicting and/or non-independent systems with less than 2n — 3 constraints, 
as well as non-conflicting and/or independent systems with more than 2n — 3 
constraints. 

For a constraint system CS, let PS be the corresponding algebraic equations 
of the constraints. The conflictness of CS means that PS does not have solu- 
tions. In the case of complex number field, this can be decided with Wu-Ritt’s 
characteristic set method [68,22]. For the case of real numbers, this can be done 
with the quantifier elimination algorithm [15]. For instance, if we want to draw 
a regular pentagon with edge 1 and diagonal 2, we will get a conflicting con- 
straint system. In other words, Zero{PS) = 0 where PS consists of equations 
for \AB\ = \BC\ = \CD\ = \DE\ = \EA\ = 1 and \AD\ = \DB\ = \BE\ = 
\EC\ = 2. 

If two dimensional constraints are conflicting, we can find the relation be- 
tween the two dimensions. In [42], this is done with the Grobner basis. In [11,22], 
both the Grobner basis and Wu-Ritt’s characteristic set methods are used to find 
such relations. In the regular pentagon problem, we just showed that the edge r 
and the diagonal d are two conflicting dimensions. To find a relation between 
them, let PS be the set of equations for the constraints \AB\ = \BC\ = \CD\ = 
\DE\ = \EA\ = r and \AD\ = \DB\ = \BE\ = \EC\ = d. Using Wu-Ritt’s 
characteristic set method, two such relations are found: 

d^ + rd — r'^ = 0 and d^ — rd — = 0. 

Since d and r are positive, the two reasonable solutions are d = and 

d = The first one represents the case where r is the edge and d is the 

diagonal. The second one represents the case where r is the diagonal and d is 
the edge. 

Let PS be the equation set of a constraint system. This system is not inde- 
pendent iff there is a polynomial P in PS such that 

Zero{PS — {P}) C Zero{P) or Zero{PS — {P}) = Zero{PS). (4.1) 

Note that to check (4.1) is actually to prove a geometry theorem. Let Cp be 
the geometric constraint of P = 0 and Ci , . . . , the geometric constraints 
for other equations in PS. Then (4.1) is equivalent to the following geometry 
theorem problem 

V points [(Cl A C 2 ■ • • A C„) Cp]. 

This kind of automated geometry theorem proving problems has been studies 
extensively [68,12,38]. We may use these efficient algorithms to prove whether 
(4.1) is true. 

4.3 A Decision Procedure for Rc-Constructibility 

Let Q be the field of rational numbers. We allow ourselves to abuse terms by 
saying that the numerical solutions of a diagram can be constructed with ruler 
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and compass if the corresponding diagram can be constructed with ruler and 
compass. It is well known [37] that a number rj can be constructed with ruler 
and compass iff there exist n real numbers 771 , 772 , . . . , and n quadratic 

equations 

Qi{xi) = x\ + hixi + Cl = 0 
Q2{X\X2) = xl + b2{xi)x2 + C2{xi) = 0 

Qn{xi,X2, . . . , X„) = + 6„(xi, . . . , Xn-l)Xn + C„(xi, . . . , X„_i) = 0 

such that Qi{rji,r] 2 , ■ ■ ■ Trji) = 0, = 1, . . . , n). Since Qi is of degree two, we 

can further assume that the bi and Ci are linear in the variables 
From the above equations, we have the well-known result: An rc-constructible 
number must be the root of an equation of degree 2^ for some k > 0. Thus 
the root of a cubic polynomial equation is rc-constructible iff the polynomial is 
reducible. Since we can efficiently factor polynomials with computer algebraic 
systems such as Maple [8] , the only remaining problem is to decide whether the 
roots of polynomial equations of degree 2^ are rc-constructible. 

Let us consider the simplest case: Decide whether the roots of a quartic 
equation 

x^ h^x^ h2X^ hix -I- ho = 0 (4.2) 

are rc-constructible. Let us assume that the quartic polynomial in (4.2) is irre- 
ducible. If a root y of (4.2) is rc-constructible, y can be written as the solution 
of the following two equations: 

Xi bxi -I- c = 0 
+ {-mxi -I- f)y - nxi g = 0. 

Without loss of generality, the first equation can be reduced to — 1 = 0. From 
the second equation, we have Xi = ^ my+n^ ■ Substituting this into xf — 1 = 0, 
the numerator is (y^ + /y + y)^ ~ (wy -\-nff = 0. Comparing the coefficients of y 
in this equation with that of (4.2), we obtain a set of equations 

2/ - ha = 0 

— 2g — f^ h 2 = 0 
2mn — 2fg hi = 0 
-I- ho = 0. 

Eliminating /, m, and n, we have 

8y^ — 4h2y^ -I- (2hiha — 8ho)y — hohg -|- 4hoh2 — hj = 0. (4.3) 

Let yi be a root of equation (4.3), /i, mi and ni solutions for /, m and n with g 
being substituted by yi. Then all the roots of equation (4.2) can be obtained 
from the quadratic equations 



+ fiy + yi ± {miy -k ni) = 0. 
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Therefore, the roots of equation (4.2) can be obtained by successively solving 
several quadratic and a cubic equations. Roots of a cubic equation are rc- 
constructible iff the cubic polynomial is reducible, i.e., it has a linear factor. 
Then we have a method of deciding whether the roots of a quartic equation can 
be constructed with ruler and compass. For the solution to the general case, 
please consult [22]. 

Let us consider Pappus’ Problem: Given a fixed point P, a fixed angle AOB, 
and a fixed length d, draw a line passing through point P and cutting a seg- 
ment of length d between lines OA and OB. Let the segment be XY . Use 
the following coordinates: O = (0,0),^ = (w,0),P = {u,v),B = {w,s),X = 
{xi,yi),Y = (a;2,0). If we consider w,u,v, and s as non-zero constants, with 
Wu-Ritt’s method, we can find a triangular set which contains the following 
quartic equation / = + w^)xf — 2ws'^{vs + wu)xi + (v'^ + u^ — d'^)w'^s'^xl + 

2d?"w^vsxi — d^w^v^ = 0. With the method mentioned above, it is easy to check 
that this diagram generally cannot be drawn with ruler and compass. With the 
above method, we can also prove the following interesting fact: if point P is on 
the bisector of angle AOB then the diagram is rc-constructible. 

5 Dynamic Geometry 

Dynamic geometry is originated in the field of CAI (computer aided instruc- 
tion) [20,36], although it can be used in other areas such as linkage design [26] 
and computer vision [27]. Many geometry theorem provers also have diagram 
generation as an assistant part, e.g., [9,66]. But these graphic editors are for 
constructive geometry statements. In this paper, we use dynamic geometry as 
an effective tool of generating geometric diagrams in GCS problems such as 
linkage design, computer vision, and GAL 

By dynamic geometry, we mean models built by computer software that 
can be changed dynamically [41]. The basic properties of dynamic models are: 
dynamic transformation, dynamic measurement, free dragging, animation, and 
locus generation [25,26]. By doing dynamic transformation and free dragging, we 
can obtain various forms of diagrams easily and see the changing process vividly. 
Through animation, the user may observe the generation process of curves or the 
figures of functions and get an intuitive idea of the properties of the functions 
and curves. Furthermore, we can combine free dragging with locus generation to 
give a powerful tool of showing the properties of curves or functions. 

Geometry Expert ( GEX) [24] is a program for dynamic diagram drawing and 
automated geometry theorem proving and discovering. As a dynamic geometry 
system, GEX can be used to build dynamic visual models to assist teaching and 
learning of various mathematical concepts. As an automated reasoning system, 
we can build dynamic logic models which can do reasoning themselves. Logic 
models can be used for more intelligent educational tasks, such as automated 
generation of test problems, automated evaluation of students’ answers, intelli- 
gent tutoring, etc. 
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With Geometry Expert, we can build the following classes of dynamic visual 
models. 

1. Loci generated by diagrams constructed by using ruler and compass. This 
class includes conics, trigonometric functions and the following curves built 
with Homer’s construction [25]: 

(a) Rational functions y = 

(b) Functions of the form y = ^f{x). 

(c) Diagrams for any rational plane curves defined by their parametric equa- 
tions 

P{t) Q(t) 

"" H(t)’^ R{t) 

where P, Q, R are polynomials in t. 

(d) Interpolation polynomials. For n distinct points {xi,yi),i = 1, . . . , n, 
draw the diagram of a polynomial y = f{x) such that yi = f{xi). 

2. Diagrams of functions using numerical computation. This class includes func- 
tions of the form: y = f{x) where f{x) could be any “elementary functions”- 
a’”, x°“, log{x), trigonometric functions - and their arithmetic expressions and 
compositions. This part is quite similar to a “Graphic Galculator,” but is 
more flexible and powerful. 

3. Loci generated by linkages with rotating joints alone. As proved by 
Kempe [40], this class includes any algebraic curve f(x, y) = 0 where f(x, y) 
is a polynomial in x and y. 

To demonstrate how to use free dragging and locus generation, we consider 
the following problem. Let H be the orthocenter of triangle ABC. We fix points A 
and B and let point C move on a circle c. We want to know the shape of the 
locus of point H. Let (xo^yo) and r be the center and radius of circle c, and 
A = (0,0), and B = (d, 0). With the method in Section 6.1, we can derive the 
equation of this locus 

{{x - xq)"^ + yo~ r'^)y'^ + 2yo(a; - d)xy + {x - (£fx^ = 0. 

But from this equation, it is still difficult to know the shape of this curve. With 
dynamic geometry software like Geometry Expert^ we can continuously change 
the radius of circle c by free dragging and observe the shape of the curve changing 
continuously (Fig. 10). 

6 Applications: Engineering Geometry 

We mentioned before that AGDG is first considered in the community of mechan- 
ical GAD. Here we want to show that by considering AGDG in a more general 
sense, it can be used to more engineering problems such as linkage design, com- 
puter vision, robotics [71,35,73] and geometric modeling [72,23]. AGDG methods 
can also be used to solve problems from mathematics, physics, and education. 
We call the methods of AGDG and these applications engineering geometry. 
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Fig. 9. Shapes of the locus of the orthocenter when the radius of circle c changes. 



6.1 Mechanical Formula Derivation 

In geometry, the techniques for AGDC have close connection with the so-called 
mechanical formula derivation. There are two kinds of formula derivation prob- 
lems. One is deriving geometry formulas [69,10,11,65]; the other is finding locus 
equations [67,65]. The problem of finding the relation between the edge and the 
diagonal of a regular pentagon in Section 4.2 is an example of finding geometry 
formulas. In general, all these problems can be considered as finding the manifold 
solutions of equation systems of positive dimension [70]. 

For a geometric configuration given by a set of polynomial equations 

. . . , n,, xi, . . . , Xp) = 0, . . . , hr(ui , . . . , Mg, Si, . . . , Xp) = 0 

and a set of inequations 



{D = di{u,x) yf 0, . . ,,ds{u,x) 0}, 

we want to find a relation (formula) between arbitrarily chosen 
variables ui,...,Uq (parameters) and a dependent variable, say, Xi. Methods 
based on the Grobner basis [11] and the characteristic set [69,11,65] are given. 

An interesting example is Peaucellier’s Linkage (Fig. 9). Links AD, AB, DC 
and BC have equal length, as do links EA and EC. The length of ED equals 
the distance from E to F. The locations of joints E and F are fixed points on 
the plane, but the linkage is allowed to rotate about these points. As it does, 
what is the locus of the joint B1 

Let F = (0,0), E = (r, 0), C = {x2,y2), D = {xi,yi), and B = (x,y), n 
and TO be the lengths of the projections of AD on the direction of line EDB 
and on the direction perpendicular to it when E, F and D are collinear. Then 
the geometry conditions can be expressed by the following set of polynomial 
equations H 

hi = yf + Xi — r"^ = 0 r = ED 

^2 = 2 / 1 “ + X2~ 2xiX 2 + y1 + x\ — = Q CD"^ = -n? + 

^3 = 2 / 1 “ 2 j / j /2 + 2^2 — 2xX2 + x'^ + y"^ — = Q CB"^ = -n? + 
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Fig. 10. A linkage generating a straight line 

hi = U 2 + ^2 — 2 rx 2 — — 4:rn — m? — 3r^ = 0 EC^ = (n + 2r)^ + m? 

= {x — r)yi — yxi + ry = 0 F is on DB, 

together with the following set of polynomial inequations D: 

d\ = xi — X ^ 0 B ^ D. 

Selecting m, n, r, and y as parameters of the problem, we want to find the 
relation among m, n, r, y and x. Using the characteristic set method, we have 
found that Zero{H / D) has only one non-degenerate component with the corre- 
sponding ascending chain ASCI- The first polynomial in ASC^ is the relation 
a; -I- 2n -I- r = 0, which tells us that the locus is a line parallel to the y-axis. 

6.2 Parametric Mechanical CAD 

A direct application of AGDC is to mechanical CAD systems. Here the advantage 
over the traditional CAD systems is not limited to a mere user friendly interface. 
In the constraint based system, the user may specify a design figure without 
knowing its shape previously which cannot be done in traditional CAD system. 
The constraint based system will find all the possible shapes of the specified 
system automatically. This is especially important in the phase of conceptual 
design. 

For instance, for the diagram in Fig. 11, if the radius of circle B is known 
then this diagram is easy to draw. If we do not know the radius of this circle 
but instead we know that this circle passes through a known point A, then this 
diagram is not so easy to draw with ruler and compass. This problem is actually 
one of the ten Appolonius’ construction problems [21]. A solution to this problem 
generated by the global propagation method is given below. 

Suppose that 0\ and O 2 are two known circles and A is a known point. 
We need to construct a circle B which is tangent to circles 0\ and O 2 and 
passes through point A. Figure 11 (b) shows the solution given by the global 
propagation method. By a construction rule [21], line UV passes through the 
similarity center of circles Oi and O 2 , i.e., the intersection W of line O 1 O 2 and a 
common tangent line T 1 T 2 of the two circles (which is easy to construct). Let S 
be the intersection of the circle passing through Ti,T 2 ,A and the circle with O 2 
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as center and passing through T 2 , H the intersectioir of hires AW and T 2 S. 
Then O 2 V is perpendicular to HV. Now, we can construct point V. Point U can 
be constructed easily. 




Fig. 11. Appolonius’ construction problem and CAD 



6.3 Linkage Design: A Realization of Kempe’s Linkage 

In Section 6.1, we introduce a linkage to generate a straight line. It is also able 
to design linkages to generate conics [26]. So it is natural to ask: What kinds of 
curves can be drawn with linkages? This problem has been answered theoretically 
by Kempe [40] who proved that any algebraic plane curve can be generated with 
a linkage. With the AGDC techniques reported in Sections 2,3, and 4, we may 
design a linkage automatically that can be used to draw a plane algebraic curve. 
First, we need three kinds of tools. 

1. The Multiplicator showir in Fig. 12 (a) coirsists of similar crossed paral- 
lelograms. Using a multiplicator, we may obtain iirtegral multiples of any angle, 
e.g. aip or bO. 




Fig. 12. Three tools for liirkage constructioir 



B' 




2. The Additor. Joining one multiplicator to another will produce the com- 
binatioir a(p ± b9. This is the mechanism showir in Fig. 12 (b) where the plate 
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BOK with angle /3 is connected rigidly to the bar. Thus we build up a linkage 
to produce LBOK = a(p ± b9 ± j3 . If OB is taken equal to A, the x-coordinate 
of the point S is A x cos{a(p ± 60 ± /3). 

3. The Translator shown in Fig. 12 (c) consists of parallelograms with OB 
pivoted at O. In this linkage, the bar O'B' is always parallel to OB and can be 
moved freely within its limits. 

Let the algebraic plane curve be defined by the following equation 

/(x,y)=0. (5.1) 

We construct a parallelogram (Fig. 13 (a)) such that OA and OB have lengths m 
and n and form angles 9 and ip with the X-axis respectively. P is a point on the 
curve. Its coordinates are then given below: 

X = m X cos 9 + n X cos ip; (5-2) 

y = m X sin 9 + n x sin ip = m x cos ( 7 t /2 — 9) -\- n x cos ( 7 t /2 — p) (5.3) 

Note that the products and powers of cosines can be expressed as the sum 
of cosines. When substituting equations (5.2) and (5.3) into (5.1), we shall have 
a sum of terms of the following form 

k 

f{x, y) = X cos{a^(f ± bi6 ± f3i) -\- C = 0 (5.4) 

where Ai and C are constants; Oi and bi are positive integers; and [3i equals 90, 
or 0. 





Fig. 13. Realization of Kempe’s linkage 



For each term Ai x cos{aip ± 6^0 ± j3i), we can use the multiplicator and 
additor to construct a link OCi such that OCi is of length Ai and forms an 
angle Oip ± bi9 ± f3i with the X-axis. Using the translators, we can construct a 
chain of links OBi, B 1 B 2 , P 2 P 3 , . . ■ , as shown in Fig. 13 (b), such that B\ = C\ 
and OCiBiBi-i is a parallelogram. Therefore, point P„ has x-coordinate: 

X = YJ^^^A^xcos{a-p±bi9±!3{) = f{x,y) - C (by equation (5.4)). 



252 



Xiao-Shan Gao 



If P is moved along the given curve, its coordinates x, y satisfy: f{x,y) = 0. 
Accordingly, the locus of the end point of the chain is 

A + C = 0, 

a straight line parallel to the Y-axis. Conversely, if Bn is moved along this line 
(with the help of a Peaucellier cell, for instance) point P will generate the curve 
f{x,y) = 0. We thus finish the construction. 

After the construction of the chain of points we need to let 

point Bn move on line A -|- C = 0, and for a given position of Bn we find the 
positions of other points by using a numerical method, such as the optimization 
method introduced in Section 3. For examples of Kempe linkages, see [26]. 

6.4 Pose Determination: The PnP Problem 

The Perspective- n-Point Problem (PnP), also known as the Location Determi- 
nation Problem (LDP) [18] or the Exterior Camera Calibration Problem [31], 
originates from camera calibration [18]. It is to determine the exterior parame- 
ters: the position and orientation of the camera with respect to a scene object 
from n points. It concerns many important fields, such as computer animation, 
computer vision, image analysis and automated cartography, robotics, etc. Fis- 
chler and Bolles [18] summarize the problem as follows: 

“ Given the relative spatial locations of n control points Pi, and given 
the angle to every pair of control points from an additional point called 
the Center of Perspective (Cp), find the lengths of the line segments 
joining Cp to each of the control points.” 

Let Xi = \CpPi\, Uij = \PiPj\, and pij = cos{LPiCpPj). Applying the cosine 
theorem to triangles PiCpPj, we obtain the PnP equation system: 

xf + x'^ — 2xiXjPij — afj = 0,{i = 1, . . . ,n;j = i + 1, . . . ,n). 

The study of the PnP problem mainly consists of two aspects: 

Diagram Construction. Design fast and stable algorithms that can be used 
to find all or some of the solutions of the PnP problem. 

Solution Classification. Give a classification for the solutions of the PnP 
equation system, i.e., give explicit conditions under which the system has 
none, one, two, . . . number of physical solutions. 

There are many results for the first problem and the second problem is not solved 
completely. 

All three kinds of AGDC techniques studied in this paper can be used to 
attack this problem. In [27], the synthetic approach and the symbolic computa- 
tional approach are used to study the P3P problem. In [75], symbolic computa- 
tion methods are used to the solution classification problem. 
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In the symbolic computation approach, Wu-Ritt’s zero decomposition algo- 
rithm [68] is used to find a complete solution decomposition for the PSP equation 
system. The decomposition has the following implications. First, it provides a 
complete set of analytical solutions to the PSP problem. Previous work usu- 
ally consider the main solutions and omits many special cases. This might cause 
trouble when the given data is from the special case. Second, by expressing all 
solutions in triangular form it provides a fast and stable way for numerical solu- 
tion. Third, it provides a clear solution space analysis of the PSP problem and 
thus provides a good starting point for multiple solution analysis. 

In the geometric approach, the three perspective angles are considered sepa- 
rately. Then the locus of the control point in each case is a torus and the control 
point is the intersection of the three tori. In this way, we give some pure geo- 
metric criteria for the number of solutions of the PSP problem. One interesting 
result is 

Theorem. The PSP problem can have only one solution if all the three angles 
formed by the three control points are obtuse. 

For the special case ai2 = 013 = 023 = 1 and pi2 = P13, the following solution 
classification for the PSP problem is given in [ 27 ]. For the sake of simplicity, we 
use two new parameters pi2 = r and P23 = P- 




Fig. 14. Solution classification for 012 = 013 = 023 = 1 and pi2 = P13 L\ is 
p = L2 is p = — 1 , and L3 is p = | The number of solutions on 

the curves are omitted in this diagram. 



254 



Xiao-Shan Gao 



1. Point P has four solutions, if and only if 

2. Point P has three solutions, if and only if 



and 






^ <P<\- 



or 



2 ^ ^ t' ^ 2' 

3. Point P has two solutions, if and only if 



\<q<l 



0<g< i 



or 



2 < g < 1 
^<P<1 



or 



^ < g < 1 



5<P<|- ' 



^2 ''t' -2 ^ and p > 2g^ - 1. 

4. Point P has one solution, if and only if 



-^ < q < ^ or ^ < q < ^ 



2g2 - 1 < p < i 



or 



2g^ - 1< P < I - 2^ 



or 



0<g< i 



or 



V2 



<q< 



^/3 



p = 



or 



0<g< i 
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Abstract. This paper proposes a DOF-based graph reduction approach 
to geometric constraint solving. The proposed approach incrementally 
solves a geometric constraint problem that is not ruler-and-compass con- 
structible by incrementally identifying a set of constrained geometric en- 
tities with 3 DOF (degree of freedom) as a rigid body and determining 
the geometric entities in the rigid body using one of the two solving pro- 
cedures: algebraic method and numerical method, instead of solving it 
simultaneously using a numerical method. However, the use of the nu- 
merical method is restricted to solve only those parts that must be solved 
numerically. By combining the advantages of algebraic solving with the 
universality of numerical solving, the proposed method can maximize the 
efficiency, robustness, and extensibility of a geometric constraint solver. 

Keywords: Parametric design, variational design, graph reduction, con- 
structive constraint solving. 



1 Introduction 

Parametric design is an approach to product modeling, which associates engi- 
neering knowledge with geometry and topology in a product design by means of 
geometric constraints [3]. It allows users to make modifications to existing de- 
signs by changing parameter values. For this reason, parametric design has been 
considered an indispensable tool in many applications such as mechanical part 
design, tolerance analysis, simulations, kinematics, and knowledge-based design 
automation: [11,15,21,25,26]. 

Many research efforts have been made toward improving parametric design 
functionality. One of the main efforts is to develop a geometric constraint solver 
that can solve a geometric constraint problem efficiently and robustly. There are 
three major approaches to solving a geometric constraint problem: 1) numerical 
approach, 2) constructive approach, and 3) symbolic approach. 

In the numerical approach, geometric constraints are converted into a system 
of numerical equations: [12,22]. Then, the system of equations is solved simul- 
taneously by an iterative numerical method. This approach can solve a variety 
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of geometric configurations including ruler-and-compass constructible and ruler- 
and-compass non-constructible configurations since any well-constrained prob- 
lem that can be represented as a set of equations can be, in theory, solved by 
numerical techniques. However, along with this advantage come some significant 
shortcomings [17]: 

— Numerical techniques have a number of problems related to numerical sta- 
bility and solution consistency. 

~ The number of iterations required to solve a set of constraint equations can 
vary substantially, depending on initial conditions given to the solver. 

— Numerical techniques are relatively inefficient. 

~ Numerical techniques cannot distinguish between different roots in the solu- 
tion space. 

Due to the limitations of the numerical approach mentioned above, most 
parametric design systems adopt the constructive approach as a fundamental 
scheme for solving geometric constraints. 

In the constructive approach, geometric constraints are represented by a set of 
knowledge such as graphs or predicate symbols: [2,4,7,9,13,14,20,23,27,28,29,30]. 
In this approach, a constraint solver satisfies the constraints by incrementally 
processing the set of knowledge. Usually, the solver takes two phases of geometric 
constraint solving: a planning phase and an execution phase. During the first 
phase, a sequence of construction steps is derived using a graph-based technique 
or a rule-based technique. During the second phase, the sequence of construction 
steps is carried out to determine geometric entities. The main advantage of 
the constructive approach is that it separates the symbolic aspects from the 
numerical aspects so that those usual problems such as numerical instabilities 
associated with the numerical approach can be minimized. 

Many research efforts have been made toward constructive geometric con- 
straint solving. Owen [23] and Bouma et al. [4] presented graph reduction ap- 
proaches to geometric constraint solving. Owen proposed a graph-based con- 
structive solver in which a constraint graph is analyzed for triconnected com- 
ponents. DCM is a commercial constraint solver based on the Owen’s method. 
Bouma et al. proposed a similar approach and, further, extended it to deal with 
more complex configurations. Moreover, Fudos et al. [7] extended Bouma et al.’s 
approach to deal with under-and over-constrained geometric constraint prob- 
lems. Ait-Aoudia et al. [1] proposed a graph reduction method based on the 
bipartite graph analysis that decomposes a well-constrained system into irre- 
ducible ones. Latham et al. [19] proposed an algorithm for processing geometric 
constraints based on the connectivity analysis between constraints and geometric 
entities. These researchers have done excellent work on this approach. However, 
there is still much work to be done. For instance, they might have some strong 
assumptions; 1) the graph of constraints is biconnected [23], 2) it is hierarchi- 
cally reducible into triangles or 3) all geometric entities have exactly 2 degrees 
of freedom (i.e. the radius of all circles must be known): [4,7]. Moreover, there 
may be no effective way of solving each set of irreducible configurations that has 
been decomposed by the graph reduction analysis: [1,19]. 
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Lee and Kim [20] proposed a graph-based rule inferencing method, which can 
overcome an inefficient geometric reasoning process of rule-based inferencing 
methods. Nevertheless, it can only deal with ruler-and-compass constructible 
configurations. Gao et al. [9] proposed a rule-based approach to solving a large set 
of complex geometric configurations but could not overcome inefficient geometric 
reasoning. 

In the symbolic approach, the constraints are translated into a system of 
algebraic equations. The system is then solved with symbolic methods such as 
Wu’s method or Grobner basis computation: [6,10,16]. The method can solve 
general nonlinear systems of algebraic equations but may require exponential 
running time. 

Figure 1 shows two models that require sophisticated solving techniques. The 
triangle in Fig. 1(a) is well constrained, apart from rigid body translation and 
rotation. Though this configuration is seemingly very simple, it is difficult for 
some constructive approaches to solve the constraints since it requires reasonably 
sophisticated ordering of construction steps. The model in Fig. 1(b) cannot be 
solved by current constructive methods since it partially requires a numerical 
solving technique or another technique to determine the geometric entities. These 
examples show that the constructive method alone cannot solve a variety of 
geometric configurations. 




(a) (b) 

Fig. 1. Gonfigurations that require sophisticated solving techniques 



In this paper, we propose a DOF-based graph reduction approach to geomet- 
ric constraint solving. The proposed approach employs a graph-based construc- 
tive approach globally and a numerical approach locally. The constraint solving 
process consists of two phases: (1) planning phase and (2) execution phase. In 
the planning phase, we incrementally identify a set of constrained geometric en- 
tities with 3 DOF (degree of freedom) as a rigid body that may be either ruler- 
and-compass constructible or ruler- and- eompass non- constructible. A sequence 
of construction steps is generated as the result of this graph-based reduction 
process. In the execution phase, each construction step is evaluated sequentially 
to determine the geometric entities in the associated rigid body using one of 
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three solving procedures: (1) algebraic procedure, (2) symbolic procedure, and 
(3) numerical procedure. The algebraic procedure determines the geometric enti- 
ties in the rigid body by calculating their positions and orientations if they are 
ruler-and-compass constructible and the symbolic or numerical procedure if they 
are ruler-and-compass non-constructible. Using the proposed approach, thus, we 
can incrementally solve a geometric constraint problem that is not ruler-and- 
compass constructible, instead of solving it simultaneously using a numerical 
method. However, the use of the numerical method is minimized. By combining 
the advantages of algebraic solving with the universality of numerical solving, 
the proposed method can maximize the efficiency, robustness, and extensibility 
of a geometric constraint solver. 

The remainder of this paper is organized as follows. Section 2 describes an 
overview of the proposed geometric constraint solver. Section 3 presents the con- 
struction plan generation phase. Section 4 describes the plan evaluation phase. 
Section 5 shows implementation results. Section 6 presents a conclusion with 
some remarks. 

2 Overview 

A geometric constraint problem is defined by a geometric model consisting of a 
set of geometric entities and a set of geometric relations, called constraints. In 
this paper, we consider only well-constrained problems and exclude relations on 
dimension variables and inequality constraints. Geometric entities used in the 
paper include points, lines, circles, line segments, and circular arcs. Constraints 
include incidence, distance, angle, parallelism, concentricity, tangency, and per- 
pendicularity. A geometric entity has its own degrees of freedom, which allow 
it to vary in shape, position, size, and orientation. For example, a point and a 
line have two degrees of freedom (DOF), and a circle has three DOF. On the 
other hand, a geometric constraint reduces the DOF of the geometric model by 
a certain number, called the valency of the constraint, depending on the con- 
straint type as shown in Table 1 [2]. In order for a set of geometric entities to be 
constrained fully, all their degrees of freedom must be consumed or taken up by 
geometric constraints. The geometric model can be represented by a constraint 
graph in which nodes are geometric entities, and edges are geometric constraints. 

The proposed constraint solving process consists of two phases: (1) planning 
and (2) execution. In the planning phase, a sequence of construction steps is 
generated by incrementally forming rigid bodies with three DOF (two transla- 
tional, one rotational), called clusters. A rigid body is a set of geometric elements 
whose position and orientation relative to each other are known. At each clus- 
tering step, a rigid body with three degrees of freedom, consisting of geometric 
entities and/or clusters and geometric constraints, is identified and combined 
into a single merged cluster, Ri. This cluster formation process continues until 
the reduced constraint graph becomes a single cluster. In the execution phase, 
each construction step is evaluated to derive positions and orientations of the 
geometric entities in the cluster by selecting an appropriate procedure among 
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Table 1. Geometric constraints and their valency 



Constraint Type 


Associated Geometric Entities 


Valency 




Point, Point 


1 




Point, Line 


1 


Distance 


Point, Circle 


1 




Line, Circle 


1 




Line, Line 


2 


Incidence 


Point, Line 


1 




Point, Circle 


1 


Coincidence 


Point, Point 


2 




Line, Line 


2 


Tangency 


Line, Circle 


1 




Circle, Circle 


1 


Angle 


Line, Line 


1 


Parallelism 


Line, Line 


1 


Concentricity 


Point, Circle 


2 



the three constraint solving procedures described in Section 4, considering the 
clustering type. 

Notations being used throughout the paper are summarized below: 

— Li, Ci, and Pi represent a line, a circle, and a point, respectively. 

— Gi represents a geometric entity (or a cluster) with two DOF. 

— Ri represents a cluster (or a geometric entity) with three DOF. 

3 Plan Generation 

If a geometric constraint model is well-constrained as shown in Fig. 2, a sequence 
of construction plan, as shown in Fig. 3, is generated by two phases; 1) prepro- 
cessing the pairs of adjacent geometric entity nodes constrained by the geometric 
constraints with two DOF as shown in Fig. 4, and 2) forming rigid bodies that 
have one of the clustering types shown in Fig. 5. Each set of nodes/edges in 
Fig. 5 is reduced into a cluster with 3 DOF. The preprocessing and clustering 
procedures are described in more detail below. 



3.1 Processing the Constraint Graph 

Most of geometric constraints take up one DOF, but there are some constraints 
that consume two DOF, as shown in Table 1 and Fig. 4. A distance dimension 
between two lines specifies both parallelism and distance so that it takes up 
two degrees of freedom. A coincidence constraint between two points also takes 
up two degrees of freedom, as does a concentricity constraint. These geometric 
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Fig. 2. A simple design and its constraint graph 



Step Clustering 



Reduced gr^h Step Clustering Reduced graph 




Fig. 3. Clustering steps for the design shown in Fig. 2 



D 



L, 



l2 




Distance u L2 D 



Concentric Cj P 



(a) 



(b) 



Fig. 4. Constraints that reduce two degrees of freedom 
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(c) 





(d) 



Fig. 5. Clustering types: (a), (b): ruler-and-compass constructible; (ci), ( 02 ): 
extended ruler-and-compass constructible; and (C3), (d): ruler-and-compass non- 
constructible 



constraints and their associated geometric entities are combined into a special 
type of clusters with 2 DOF. In the proposed approach, a cluster with 2 DOF is 
treated as a pseudo geometric entity. During the preprocessing, thus, the set of 
a geometric constraint with 2 valency and its two associated geometric entities 
is identified and combined into a (pseudo) geometric entity as shown at step 0 
in Fig. 3. 

3.2 Cluster Forming 

Each set of nodes and edges shown in Fig. 5 forms a rigid body with three 
DOF. In this incremental clustering procedure, a rigid body with three DOF 
is identified and merged into a reduced cluster. By repeating these identifying 
and merging procedures, the constraint graph may be reduced to a single cluster 
node as shown in Fig. 3. The clustering process is classified into three types: 
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(1) ruler-and-compass constructible or RCC, (2) extended ruler-and-compass 
constructible or ERCC, and (3) ruler-and-compass non-constructible or RCNC. 
An appropriate solving procedure is developed for each of the clustering types 
as shown in Table 2 



Table 2. Solving procedures according to clustering types 



Clustering Types 


Type Descriptions 


Related Graphs 
in Fig. 5 


Solving 

Procedures 


One connecting edge 


Three G nodes 


a 


A 


Two connecting edges 


One G node and one R node 


b 


A 




Three R nodes 


d 


C 




One geometric entity 
constrained by three constraints 


Cl 


B 


Three connecting edges 


One geometric entity 
constrained by two constraints 


C2 


B 




Each geometric entity 
constrained by one constraint 


C3 


C 



The geometric configurations shown in Fig. 5(a) and 5(b) are ruler-and- 
compass constructible. Thus, they can be efficiently determined by an algebraic 
solving procedure [20]. The geometric configuration shown in Fig. 5(c) is not 
ruler-and-compass constructible. To solve this configuration effectively, the clus- 
ter formation process is further classified into three types, according to the rela- 
tions between geometric entities in two clusters: (1) one-to-three, (2) one-to-two, 
and (3) one-to-one, as shown in Fig. 5(ci), 5(c2), and 5(ca), respectively^. 

One-to-three and one-to-two type configurations are effectively solved by 
an extended ruler-and-compass method, whereas one-to-one type clusters are 
not. For example, the geometric configurations shown in Fig. 6(a) and 6(b) 
are extended ruler-and-compass constructible. The one-to-one type configuration 
shown in Fig. 7 is solved effectively by a numerical procedure. 

4 Plan Execution 

Fach construction step is evaluated to derive the relative positions and orien- 
tations of geometric entities in the cluster (or rigid body) by executing an ap- 
propriate solving procedure described below. A ruler-and-compass constructible 

^ A, B, and C represent procedures for solving ruler-and-compass constructible 
(RCC), extended ruler-and-compass constructible (ERCC), and ruler-and-compass 
non-constructible (RCNC) configurations, respectively. 

^ In Fig. 5, Gi represents a geometric entity or cluster with two degrees of freedom. Ri 
represents a cluster or geometric entity with three degrees of freedom. Rij represents 
a geometric entity in the cluster Ri. 
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Fig. 6. Extended ruler-and-compass constructible: (a) one-to-three, (b) one-to- 
two 



configuration is determined by an algebraic solving procedure: [20,21]. This pro- 
cedure sequentially calculates the position and orientation of each undetermined 
geometric entity within the cluster by solving the algebraic equations associated 
with the constraints. An extended ruler-and-compass constructible is solved by 
an extended algebraic solving procedure. This procedure determines the geo- 
metric entities by finding a sequence of rotations and translations to satisfy the 
geometric constraints. A ruler-and-compass non-constructible cluster is solved by 
a symbolic or a numerical method according to its topological structure. These 
solving methods are explained below. 



4.1 Solving the RCC Clusters 

For the configuration shown in Fig. 5(a), we calculate the position and/or orien- 
tation of each geometric entity Gi, relative to the other entities within the rigid 
body, by solving algebraically the equations associated with the constraints (that 
is, evaluating inferred rules, see [20]). For the configuration shown in Fig. 5(b), 
we compute the position and orientation of the geometric entity G\, relative to 
the cluster i?i , by solving the algebraic equations associated with the constraints 
between geometric entities in G\ and i?i. 

4.2 Solving the ERCC Clusters 

Considering the geometric entities and their relations in the clusters, an appro- 
priate procedure is developed for each type of the extended ruler-and-compass 
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Fig. 7. Ruler-and-compass non-constructible 



constmctible clusters. Each procedure specifies a sequence of rotation & transla- 
tion operations that transforms one cluster R\ relative to the other cluster i?2 to 
satisfy the geometric constraints. As an example, the procedure for the extended 
ruler-and-compass constmctible cluster shown in Fig. 6(b) is summarized as fol- 
lows. In the figure, (i) P2 and P5 in i?i, (ii) L3, Pi, and Pq in R2, and (iii) P2 is 
connected to P\ and L3. 

PROCEDURE ONE_TO_TWO(Pi(P2, P5), ^2(^3, Pi, Po)) 

INPUT: two clusters Pi and P2 

OUTPUT: a merged cluster P consisting of Pi and P2 
L = line(P2,normal(direction(L3)))^; 

IPi = intersect(L3, L); 

translate (i?2, vector-differ(P2, IP))', 

Cl = circle(P2, D3); 

IP2 = intersect(L3, Ci); 

translate (i?2, vector-differ(/P2, Pi))', 

D =distance(P2, Po); 

C2 = circle(P2, P); 

C3 = circle(P5, D2)', 

IP3 = intersect (C2, U3); 

A = angle (vector-differ(Po - P2), vector-differ(/P3, P2)); 
translate(P2, -P2); 
rotate(P2, -A); 
translate (P2, P2); 

END_PROCEDURE 

® A line is assumed to be defined by its direction vector (normal(direction(I/ 3 ))) and 
a point on the line (P 2 ). normal(direction(L 3 )) represents a vector perpendicular to 
direction(i 3 ). 
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4.3 Solving the RCNC Clusters 

An efficient procedure is developed to solve the ruler-and-compass non con- 
structible clusters. For the geometric configuration shown in Fig. 5(03), the con- 
straint problem is solved by finding a transformation matrix that represents the 
relation between two clusters R\ and i?2- An iterative method based on the 
Newton method is used to calculate a parameter, 9 , for rotation and two pa- 
rameters, dx and dy, for translation that define a 3x3 transformation matrix. 
This transformation matrix is used to position the cluster i?2 (or R \ ) relative to 
the base cluster R\ (or R2) so that the geometric constraints between the two 
clusters are satisfied. The values of the three parameters for the transformation 
matrix can be computed by using the iterative Newton’s method given by 

J^^+l F(X^) ■ (J(X^))-^ 



where the vector X, function F{X), and Jacobian matrix J{X) are defined as 
follows 



A = 



( dx I F(X) 

\dyj 



f2{X) J{X) 

\MX)J 



dfi(X) 


dfi(X) 


dfi(X) 


as 


adx 


ddy 


dh{X) 


df2(X) 


dh(X) 


as 


adx 


ddy 


dfsiX) 


dh(X) 


dhiX) 


as 


adx 


ddy 



A numerical solving procedure can also handle the 3-cluster configuration 
shown in Fig. 5(d). Since three clusters are involved, six functions for the New- 
ton method are necessary to be defined to compute two transformation matrices 
for i?2 and R3, relative to i?i. The proposed procedure improves the reliability 
of the numerical approach by reducing the number of equations to be solved. For 
the 2-cluster merging shown in Fig. 5 (03), for example, we solve three equations 
to determine the geometric entities by computing a transformation matrix in this 
procedure, instead of 12 equations in conventional numerical methods where 6 
geometric entities or 12 DOF are involved. Thus, we can minimize the numerical 
instability of a numerical solver. As explained above, the one-to-one type cluster 
can be solved effectively by a numerical method. Among these clusters, however, 
the clusters with the same configuration as that shown in Fig. 8 can be effec- 
tively solved by a symbolic approach such as Grobner basis computation [4]. 
Eliminating some variables with Grobner ’s base computation generates a 4th 
order polynomial equation. This equation can be easily solved algebraically. The 
difference between the two clusters shown in Fig. 7 and Fig. 8 lies in the con- 
straint relation in each cluster. The configuration shown in Fig. 8 has a cyclic 
relation among geometric entities in each cluster. On the other hand, the con- 
figuration in Fig. 7 has no such a relation. 



4.4 Extending Clustering Types 

The clusters shown in Fig. 5 can solve a large set of geometric configurations 
including RGNG. However, it is still needed to extend the solver since it cannot 
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Fig. 8. One-to-one type cluster with a cyclic relation 



deal with some complex configurations if they require more than three clusters 
to satisfy their geometric constraints. Figure 9 shows such an example [9]. The 
solver cannot continue the reduction operations because there is no rigid single 
loop in the reduced graph. However, the whole graph is well-constrained so that 
it can be solved. In this case, the solver takes another type of the rigid loop 
identification procedure called multi- circuited loop identification. This procedure 
takes several loop identifications and reductions until it finds a rigid subgraph. 
The identified rigid subgraph in this procedure is called a multi-circuited cluster. 
For the example in Fig. 9, the solver first finds a loop consisting of i?i, i? 2 , 
and Lq, and then reduces it as a cluster Ci. At this time, Ci is under-constrained. 
After then, it finds another cluster C 2 consisting of Ci and R 3 . Note that C 2 is 
over-constrained. This compensates for the free DOF remaining in Ci such that 
the 2-circuited cluster is well-constrained. Finally, the 2-circuited cluster can be 
solved algebraically or numerically according to the geometric structure. 




(a) 



(b) 



Fig. 9. Solver extension: 2-circuited cluster identification 
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5 Implementation 

The proposed geometric constraint solving procedures have been implemented 
in Visual C++ on Windows NT. Figure 10 shows a snapshot of the developed 
constraint solver. Figure 11~14 shows some implementation results. Figure 14 
shows a mechanical part and its modified one that are modeled by using the 
feature-based parametric modeling system [20] . 




Fig. 10. A snapshot of the developed constraint solver 





Fig. 11. A model with one-to-one type of a RCNC cluster and its modification 
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Fig. 12. A model with a multi-circuited cluster and its modification 





Fig. 13. A model of the type shown in Fig. 5(d) and its modification 




Fig. 14. 3D feature-based parametric modeling: a hinge design and its modifi- 
cation 
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6 Discussions 

We have presented a new DOF-based graph reduction approach to geometric 
constraint solving. The proposed approach employs a graph-based constructive 
approach globally and a numerical approach locally. The use of the numerical ap- 
proach is restricted to solve only those clusters that must be solved numerically. 
By combining the advantages of the graph-based constructive approach with the 
universality of the numerical approach, the proposed method can maximize the 
efficiency and robustness of a geometric constraint solver. 

Note that our approach is also based on graph reduction, similar to Bouma et 
al.’s approach [4]. However, our method is different from their method in graph 
reduction (or cluster formation) process. While their graph reduction method 
is based on analyzing node-sharing among clusters, our reduction method is 
based on analyzing edge-sharing among clusters and their degrees of freedom. 
This allows us to deal consistently with a larger set of ruler-and-compass non- 
constructible configurations, as shown in Fig. 5 and Fig. 9. For example, Bouma 
et al.’s method can not handle a geometric configuration having a circle with a 
variable radius that can be seen in Appollonius’ drawing problems and multi- 
circuited clusters as shown in Fig. 12. Our approach is very simple in concept, 
but realizes efficient geometric constraint solving. 

Some of the future works are listed below: 

— For multi-circuited clusters, currently, most of them are solved numerically. 
However, some of them can be algebraically [9]. We are extending the solver 
to match multi-circuited clusters with rule graphs for algebraic solution 
based on the previous research work [17]. 

— Although we have used a modified Newton-Raphson method with global 
convergence property to improve the reliability of the numerical constraint 
solving procedure [24], further work is still needed. It may be considered to 
adopt the homotopy method [18] for numerical solving. 

— Although we have only dealt with well-constrained problems, it is worth 
while to handle under-constrained problems. The proposed approach works 
well with over-constrained and well-constrained problems. However, it has 
difficulties when dealing with under-constrained problems because there ap- 
pears to be no reliable way to locally add constraints to transform an under- 
constrained problem to a well-constrained one [8] . Thus, the local reduction 
analysis is insufficient for under-constrained problems, and a global analysis 
is needed. We are trying to deal with under-constrained problems by globally 
analyzing the cluster configurations derived by the graph reduction analysis: 
[1,19]. 
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Abstract. This paper deals with modeling and problem solving of vari- 
ant geometry in mechanical computer aided design. Enhanced undirected 
graph is used as a schema for modeling a constraint geometry system. 
Two important techniques called equivalent line segment method and 
separable entity group approach are introduced in this paper. These tech- 
niques developed by the authors are used to unify and simplify variant 
geometry problem solving. 



1 Variant Geometry Problems in MCAD 

Variant geometry deals with flexible geometry systems in which the geometry 
entities are subjected to variant geometric constraints. A variant geometry prob- 
lem can be formulated as 

[geometry entities, constraints]. 

Variant geometry has varieties of applications in mechanical computer aided 
design (MCAD) [4,1,11,2,3,9,10]. 

A. Parametric and feature CAD 

More than seventy percent of practical designs are adaptive or innovative de- 
signs including series product design, adapted to new working condition design, 
extending and adding new function design, etc. All of these designs usually begin 
with or imitate to a prototype design; they share some common features (topol- 
ogy and/or shape) but formed with some variant differences. To effectively deal 
with these variant designs, a new generation of design approach called paramet- 
ric modeling and/or feature-based modeling has been developed. A new design 
can be generated by changing specific parameters and/or by combining appro- 
priate predefined features in an intelligent CAD system. The typical application 
cases include: 

2D parametric drafting. System entities are adjustable and changeable drawing 
primitives: line, arc, circle, etc. The subjected constraints are either explicit or 
implicit. Explicit constraints are annotated in drawing like dimensional annota- 
tions. Implicit constraints are defined by default drawing conventions like some 
lines keeping horizontal orientation, some lines keeping vertical orientation, some 
related entities tangent to each other. 
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Parametric (also feature-based) solid modeling. System entities are adjustable 
and changeable geometric feature objects such as faces, edges, vertices, etc. The 
constraints are distance dimension constraints, relative position constraints and 
other relations. 

B. Evaluation and Synthesis of Kinematics of Mechanisms 

A mechanism is also a system of geometry objects subjected to some con- 
straints related to each other. Geometry objects are components and component- 
like subassemblies. The constraints are rotary connection, sliding connection, 
specified kinematics feature limitation including fixed to frame or specified kine- 
matics control, etc. 

Keeping a mechanism conforming to its given kinematics constraints and se- 
quentially changing the position of its driving component, the variant geometry 
processing system will generate history positions of the whole mechanism; this 
is the simulation procedure for the mechanism. Designers can easily try differ- 
ent variances of a mechanism and evaluate the different kinematical behaviors 
instantly. This is very useful for optimization of mechanism design. 

All the geometry applications are intelligent processes based on automated 
deduction in geometry. The processes include the following steps: automatically 
establishing the variant model; transforming the constraint conditions into pro- 
cessable (calculating and reasoning) relations; generating and evaluating the 
variant system by solving the relations. The main issues of a variant geometry 
processing are the modeling method and solving approach. Techniques presented 
below are developed by the authors for variant processing of parametric drafting 
and variant processing of planar linkage mechanism simulation. 

2 Model Graph 

A variant geometry system is a system composed of a number of geometry enti- 
ties related to each other with variant constraint relations. The system structure 
can be modeled with an undirected graph [9,10,8]. Nodes of a model graph are 
geometric entities of a variant geometry system, and edges connecting the nodes 
in the graph represent the constraint relations between the corresponding enti- 
ties. A model graph relates the data within a system and is the structure skeleton 
in this system database. 

For a planar linkage mechanism, the model graph can be express as 
mechanism=Graph(i?, L, C). 

In the above expression, B is the collection of links, B = Bi, B 2 , ■■■, Bm, where m 
is the number of links; L is the collection of constraints, L = Li, L 2 , ■■■, Ln, 
where n is the number of constraints; G is the constraint relations, 

where i? is a rotary connection, Pj, Pk are pivot points. S' is a sliding connection, 
and Pu, Pv and P^ are points on a sliding surface. Figure 1 shows a planar linkage 
mechanism and its model graph, the model graph can also be expressed as Graph 
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{Bi, B2, B3, B4, S5, Bq}, {Li, L2, A3, A4, A5, Lq, L7}, Li{R, B2{Pi)), 

A2(i?,A2(P2),A3(P2)), A3(i?, 53 (^ 3 ), 54 (^ 3 )), A4(i?,A4(P4),A5(P4)), 

L 5 (P,P 5 (P 5 ),Pi(P 5 )), A 6 (P,P 4 (P 6 ),P 6 (P 6 )), Lr{S, Be{Pr), Bi{Ps, Pg)) . 





Fig. 1 Linkage mechanism and its model graph 

The number of geometry entities in an engineering drawing is much more than 
in a mechanism. The constraint relations in a parametric drawing are also more 
complicated than in a mechanism. We design an enhanced graph structure for 
modeling the system of a variant drawing. Figure 2 shows an example of paramet- 
ric drawing, which is composed of 12 geometry entities subjected to dimensional 
and other constraints. Fig. 3 shows the graph data structure for the drawing. 
The relation and constraint conditions (tangency, symmetry, co-cyclic, concen- 
tric) are organized as special link list structures for convenient search and extract 
in processing, the data of key points for featuring entities are also included in 
the data structure. 

Users can define and modify a constraint system interactively; the system 
will update the model graph automatically. 

3 Equivalent Line Segment Entity Method 

The basic types of geometry entities in engineering drawing are line, arc and 
circle. A line segment usually is defined with two end points. Every point has two 
coordinate parameters. So, the number of degrees of freedom for a line segment 
is 4. A circle can be defined with one center point and one radius parameter, so 
the number of degrees of freedom for a circle is 3. An arc can be defined with one 
center point, one radius parameter, and two independent parameters to locate 
the start and end points of this arc, so the number of degrees of freedom for 
an arc is 5. To effectively deal with different types of entities, a new method 
called equivalent line segment entity method has been developed [9, 5, 6, 8]. With 
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this method, all line, arc and circle entities can be transformed into equivalent 
entities that are line segments or combination of line segments (Fig. 4). 

The equivalent entity of a line is the line segment itself. 

The equivalent entity of a circle is a line segment with length equal to the 
radius of the circle and one end point coinciding with the center point of the 
circle; the other end point is movable and the orientation of this line is not 
constrained. 

The equivalent entity of an arc is a combination of two equivalent line seg- 
ments. These two equivalent line segments are drawn from the center of the arc 
to two end points of the arc respectively. 





Fig. 2b Drawing after parametric process 
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Dimension KeyPoints Entities Constraint Relations 



Tangent Tangent Symmetry 




Fig. 3 Graph data structure for the drawing in Fig. 2 



All the above equivalent entities have the same number of degrees of freedom and 
the same feature points as their original entities. During the variant processing, 
an entity is treated as an object in object-oriented programming. Every entity 
has two forms: original form and equivalent form. These two forms can be inter- 
transformed from one form into another form based on the common feature 
points. The feature point data and the inter-transformation operation procedures 
are encapsulated within entity objects. 




Fig. 4 Line, circle, arc and their equivalent entities 

Figure 5 shows some drawing examples of equivalent entity transformation. A 
circle and a tangent line were transformed into two equivalent lines contacted at 
tangent point. Two tangential circles were transformed into two equivalent lines 
meeting at tangent point. A contour composed of lines and arcs was transformed 
into a contour composed of lines only. The standard form equation for a circle 
or an arc is a quadratic polynomial equation; and the equation for a line is a 
linear equation. 
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Fig. 5 Drawing examples of equivalent entity transformation 

When a circle or an arc maintains tangent or intersection relations with other 
entities, it is quite cumbersome to process these relations based on quadratic 
equations. In the above cases, replacing connected circles or arcs with equivalent 
lines drawn from the center point to related feature points (tangent points, inter- 
section points, etc.) will lead to simplification of establishing relation equation 
procedures. Furthermore, the equations are all linear equations. Transforming 
arbitrary drawings composed of different types of geometry entities into draw- 
ings composed of unified equivalent line segments will unify the data structure, 
regulate the relation equations and significantly simplify the variant processing. 

4 Separable Entity Group Approach 

All the geometric parameters of a geometric constraint system should be con- 
formed to the constraint relations. The whole number of unknown geometric 
parameters for a solvable constraint system should be equal to the number of 
total constraint equations, if the system is well defined. Usually, a variant pro- 
cessing for a constraint system involves a solving procedure of a large number 
of simultaneous conditions. Several intelligent approaches have been developed 
for solving this constraint problem, including iterative numerical method, geom- 
etry reasoning method and graphic constructive method [4,1,11,2]. However, the 
difficulty of solving a large number of simultaneous equations exists in all vari- 
ant processing systems. Divide and conquer is an effective strategy in practice 
for solving large size problems and has been used in different variant process- 
ing systems with different implementations. An approach called separable entity 
group approach is described in the following, which is also developed based on 
the divide and conquer strategy. A large variant system will be decomposed to 
several separable entity groups step by step [9,10,7,8]. A separable entity group 
is a subsystem of a constraint system, which can be solved independently. The 
separability criterion of an entity group is that the total number of unknown 
geometric parameters of this group is equal to the number of equations defined 
by the related constraint conditions inside this group. 

All the geometry entities and constraint relations have been schematically 
represented in a model graph. A separable entity group is a linked node cluster 
satisfying the separability criterion in a model graph. The basic principle for 
extracting separable entity group is searching the possible linked node cluster 
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in the model graph and checking the cluster with separability criterion. The 
searching procedure is iterative during the variant processing. There might be 
multiple-choice of separable entity groups in processing, we prefer that the sepa- 
rable entity groups with least number of entities (least number of nodes) should 
be extracted and processed first. 

The procedure of extracting separable entity groups is incorporated in the 
variant processing procedure; the main points of the procedure are as follows 

1. Establish two entity lists: the defined entity list and undefined entity list. 
An entity is defined if all the necessary parameters for defining this entity 
are known. 

2. Interpret and transform system constraint conditions into equations. Estab- 
lish system constraint equation list. 

3. Sequentially check each equation in system constraint equation list. If there 
is an equation including only one unknown variable; then solve it, update 
entity lists and eliminate this equation from system equation list. 

4. Set the limited number of entities in separable entity group NE = 1. 

5. Establish new working (candidate) entity group. Select an entity from unde- 
fined entity list which is adjacent (related) to a defined entity. Set this entity 
as first entity in working entity group. 

6. Check the working entity group with separability criterion. If the criterion 
is satisfied, go to step 15. 

7. Add a new undefined entity to the working entity group, the new undefined 
entity should be adjacent (related) to the entities of working entity group. 
It is better if the new entity is also adjacent to a defined entity. 

8. Set NW equal to the number of entities in working entity group. 

9. If NW is not greater than NE, then go to step 6. 

10. If NW is greater than NE and the first entity in working entity group is 
not the last entity in undefined entity list, then go to step 5 but try to set 
other undefined entity as first entity in working entity group. 

11. If NW is greater than NE and all the entities of undefined entity list had 
been tried as first entity in working entity group, then set NE = NE + 1. 

12. If if is not greater than the number of undefined entities, then go to step 
5. 

13. If A" if is greater than the number of undefined entities, then stop and output 
message: “the variant system is under-constrained.” 

14. If there is no new entity added to defined entity list after a whole iterative 
cycle of variant processing procedure, then stop. 

15. Extract the working entity group as a separable entity group; solve the con- 
straint equations; and update defined entity list and undefined entity list. 

16. If undefined entity list is empty, then stop. 

17. Go to step 3. 

The running result of linkage mechanism showed in Fig. 1 will be as follows 

1. Assume specifying link 1 as the frame (fixed) link and link 5 as the driving 
link. The system will update system database and add link 1 and link 5 into 
defined entity list. 
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2. Extract link cluster 46 in the model graph as the first separable entity group. 

3. Extract link cluster 32 in the model graph as another separable entity group. 

The running result of drawing showed in Fig. 2 might be as follows 

1. Assume specifying point PA as fixed point and specifying the coordinates 
of point PA. 

2. Solve linear dimension equations and get the coordinates of point PB and 
point P9. 

3. Extract entity circle No. 9 as separable entity group. 

4. Extract entity circle No. A as separable entity group. 

5. Extract entity circle No. B as separable entity group. 

6. Extract entity circle No. C as separable entity group. 

7. Extract entity cluster (arc No. 1-line No. 2-arc No. 3-line No. 4-arc No. 5-line 
No. 6-arc No. 7-line No. 8) as separable entity group. 

Usually, a candidate for separable entity group is an entity cluster linked be- 
tween defined entity nodes in a model graph. The proposed approach is efficient 
since the extracting processing is performed by directly searching and checking 
the separability criterion along the relation path in a system model graph. For 
further increasing the efficiency, the following content-oriented features can be 
used as rules of thumb for guiding the extracting processing. 

— If entities in a node chain form a contour in a drawing, this node chain is 
probably a separable entity group. 

— If entities in a node cluster form an independent geometric part in a geometry 
object, this node cluster is also probably a separable entity group. 

— The entities linked between driving components (there might be multiple 
driving sources) and fixed component is a good beginning choice as a basic 
cluster for further searching in kinematics analysis. 

5 Implementation 

The CAD and CG Research Institute of Dalian University of Technology already 
developed a prototype expert system running on MS-Windows environment for 
parametric drafting and simulation of linkage mechanism based on the above 
approach. The module for simulation of linkage mechanism was implemented by 
using PROLOG language, and the parametric drafting module was implemented 
by using C-I--I- language. The prototype system co-works with a conventional 
CAD system. The system architecture is showed in Fig. 6. Figure 7 shows a 
working display of linkage mechanism simulation and Fig. 8 shows a working 
display of parametric drafting. The implementation verified the correctness and 
effectiveness of the above method and approach. 
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Fig. 6 System architecture 




Fig. 7 Working display of linkage mechanism simulation 

6 Conclusion 

Variant geometry has a very important variety of applications in MCAD, ranging 
from geometry design to kinematics in 2D and 3D. Parametric, feature-based 
modeling design and variant kinematic analysis are applications of automated 
deduction in geometry. 

An enhanced graph structure has been designed as structure skeleton for 
modeling variant geometry system. 

Keeping the same number of degrees of freedom and same feature points, all 
the line, arc, and circle entities can be transformed into equivalent line segments. 
With the equivalent entity method, a complicated drawing composed of different 
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types of entities can be replaced with an equivalent drawing composed of unified 
equivalent line segments only in further variant processing. 

A separable entity group is a separable subsystem of a variant system, which 
can be processed separately. Based on the separable entity group approach, the 
variant process of a large variant system can be decomposed to the sequential 
variant processes of small size subsystems step by step. 

The techniques of equivalent entity method and separable entity group ap- 
proach are very useful in variant problem solving. These techniques can also be 
applied with other intelligent approaches in variant process. 




Fig. 8 Working display of parametric drafting 
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